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GALECT1N EXPRESSION IS INDUCED IN CIRRHOTIC LIVER AND HEPATOCELLULAR CARCINOMA 

FIELD OF THE INVENTION 

The present invention relates 'to the discovery of a marker for liver disease. Novel diagnostics, prognostics, 
therapeutics and methods of use of the foregoing for the treatment and prevention of hepatocellular carcinoma are 
5 also disclosed. 

BACKGROUND OF THE INVENTION 
Hepatocellular carcinoma is a major type of cancer causing a quarter of a million deaths worldwide each year 
(Kew, M.C., Tumors of the liver. In: 0. Zakim and T.O. Boyer (eds.), Hepatology. A Textbook of Liver Disease, pp. 
1206-1240, W.B:Saunders, Philadelphia (1990)). While various factors have been identified as causes for HCC, the 
10 major established factors are infections of hepatitis viruses B (HBV) and C (HCV). Various potential mechanisms exist 
whereby infection by HBV can result in HCC. These include both chronic liver injury caused by cytotoxic T cell 
responses to infected hepatocytes and intracellular occlusion resulting from expression of viral protein (Chisari, F.V., 
Analysis of hepadnavirus gene expression, biology, and pathogenesis in the transgenic mouse. In: Current Topics in 
Microbiology arid immunology, pp. 85-99, Springer-Verlag, Berlin (1991)). 
15 Deregulated expression of a number of proteins such as tumor suppressor genes and altered cellular activity 

of other cellular genes have been associated with HCC. (Robinson, W.S., J.GastroenteroLHepatol, 8:95 (1993)). 
Transactivation by HBV* protein is also considered to significantly contribute to cell transformation. This protein has 
been shown to transactivate several viral and cellular genes such as cjun, cmyc t ^interferon and class I 
histocompatibility complex. (Robinson, W.S., J.GastroenteroLHepatol, 8:95 (1993)). The transcription factors CREB 
20 and ATF2 and various cis-acting ONA elements that regulate cellular genes, such as kB, API and AP2, are also 
affected by HBV-X. (Robinson, W.S., J.GastroenteroLHepatol, 8:95 (1993)). Although significant strides have been 
made toward understanding the onset of HCC, the need for sensitive diagnostics, prognostics, and therapeutics for 
the treatment and prevention of HCC, is manifest. 

BRIEF SUMMARY OF THE INVENTION 
25 Aspects of the present invention relate to the discovery that hepatocytes in liver biopsies from subjects with 

hepatocellular carcinoma (HCC) have significant levels of galectin-3 (76% immunohistochemically positive) while 
galectin-3 expression is low or absent in normal hepatocytes. Notably, the correlation of galectin-3 expression and 
HCC is independent of whether the subject had prior hepatitis B virus infection (14 of 18 HCC cases from HBV+ 
patients, and 5 of 7 cases from HBV- patients demonstrated positive galectin-3 immunohistochemistry). This 
30 indicates that galectin-3 expression may be used to detect HCC resulting from a variety of factors. However, co- 
transfection studies, using a galectin-3 promoter construct and an HBV-X protein (HBV-X) expression vector 
demonstrated that galectin-3 expression can occur through transactivation of the galectin-3 promoter by HBV-X. 
Additionally, aspects of the present invention relate to the discovery that galectin-3 is abundantly expressed in 
cirrhotic liver in a peripheral distribution within regenerating nodules. The galectin-3 expression in rapidly proliferating 
35 hepatocytes in cirrhotic liver can be used to identify an early neoplastic event. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
FIGURE 1 shows the results of two chloramphenicol acetyl transferase assays in which the transactivation 
of the galectin-3 promoter by co-transfection with an HBV-X protein expression vector is demonstrated. 
FIGURE 2 shows the cDNA sequence of human galectin-3. 
5 DETAILED DESCRIPTION OF THE INVENTION 

In the present invention, several embodiments relate to the discovery that hepatocytes in liver biopsies from 
subjects with hepatocellular carcinoma (HCC) have significant levels of galectin-3 (76% immunohistochemically 
positive); whereas, gaiectin-3 expression is low or absent in normal hepatocytes. In addition, galectin-3 expression in 
HCC is independent of whether the subject had prior hepatitis B virus infection (14 of 18 HCC cases from HBV+ 
10 patients, and 5 of 7 cases from HBV- patients demonstrated positive galectin-3 immunohistochemistry). Furthermore, 
co-transfection of a galectin-3 promoter construct and an HBV-X protein (HBV-X) expression vector induces galectin-3 
expression, through transactivation of the lectin promoter by HBV-X. Additionally, galectin-3 expression in 
regenerating nodules is also associated with cirrhosis of the liver. Herein we provide novel diagnostics and 
prognostics for determining whether a subject is suffering from HCC or cirrhosis of the liver or is likely to develop HCC 
15 or cirrhosis of the liver. Therapeutic agents or vaccine components for the treatment and/or prevention of HCC or liver 
cirrhosis are also provided. 

Galectins are a family of proteins characterized by sequence homologies in a domain which exhibits 
galactose-specific carbohydrate binding activity. There are presently 12 characterized eukaryotic members, and 
currently a total of two dozen members have been predicted by homology from DNA sequences present in databases. 
20 Several dozen human members can be anticipated from projections based on the number of similar members present in 
the completely sequenced Caenorhabditis elegans genome. Galectins do not contain traditional sequences that specify 
membrane translocation, but are both secreted and located intracellularly. Of the characterized galectins, galectins-1 
and 3 are the most extensively studied (Barondes, et al., J.Biol.Chem.. 269:20807 (1994); Kasai, K. and Hirabayashi, 

l.,J.BiochemJTokyoh 119:1 (1996)). 
25 Galectin-3, previously designated as eBP, CBP35. Mac-2, L-29 and L-34,,has been associated with assorted 

processes such as cell growth, tumor transformation,^ metastasis. (Moutsatsos, et al., Proc.NatlMad.Sci.USA. 

84:6452 (1987). (Raz, et al., Cancer Res.; 46:3667 (1986). (Raz, et al., IntJ. Cancer, 46:871 (1990), and,(Hsu. et al.. 

AmJ.PatM. 148:1661 (1996)). This lectin of; approximately 30.000 Da is composed of two, domains: a carboxyl- 

terminal domain that contains the carbohydrate-binding region which binds .saccarides with terminal galactose 
30 residues and an amino-terminal domain consisting primarily of tandem repeats of nine amino acids (Liu, et al., 

Biochemistry, 35:6073 (1996)). Galectin-3 is expressed in various tissues and organs and is present in dendritic cells 

and a variety of normal cells of epithelial origin but is undetectable in normal lymphocytes and hepatic cells. 

One aspect of the present invention relates to methods of determining whether an individual is suffering 

from HCC or cirrhosis of the liver, or is likely to suffer from HCC or cirrhosis of the liver, comprising determining 
35 whether the level of galectin-3 expression in the subject's hepatocytes is above normal. Example 1 describes the 
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immunohistochemical characterization of galectin-3 expression in samples from normal liver, subjects with cirrhosis of 
the liver, and subjects with HCC. Pathological specimens were obtained from archives at the Medical Center of the 
University of California, San Diego, or the Veterans General Hospital, Taichung. Taiwan. Cases studied included 
normal liver (six cases), cirrhotic liver (eight cases), hepatocellular carcinoma (25 cases), fibrolamellar tumor (two- 
5 cases) and hepatoblastoma (one case). 

EXAMPLE 1 

The expression of galectin-3 in normal human liver and cirrhotic human liver biopsy tissue was analyzed by 
immunohistochemistry. These normal liver biopsies were obtained from normal subjects or regions in HCC patients 
unaffected by tumor. Tissues were fixed in either paraformaldehyde (for samples probed with rabbit antibody) or B5 
10 (for samples probed with monoclonal antibody) and embedded in paraffin. Tissue sections were processed for 
immunohistochemical detection of galectin-3, as described previously (Konstantinov, et al.. AmJ.PathoL 148:25 
(1996)). In brief, primary antibodies used were affinity-purified rabbit anti-galectin-3 antibody or mouse monoclonal 
anti-galectin antibody A3A1 2 (Liu, et al., Biochemistry, 35:6073 (1996)). both at 10 //g/ml. Secondary antibodies 
were goat anti-rabbit peroxidase and goat anti-mouse peroxidase conjugates, respectively. Antibody controls used 
15 were normal rabbit immunoglobulin and isotype matched irrelevant monoclonal antibodies, and did not result in any 
staining. Chromogenic substrates were 3,3' diaminobenzidine or 3-amino-9-ethylcarbazole. 

A total of four normal samples were probed with affinity purified rabbit polyclonal anti-galectin-3. Positive 
reactivities were demonstrated by brown precipitates (peroxidasediaminobenzidine) and counterstaining was done 
with hematoxylin. Prominent staining was noted in epithelial cells lining the bile ducts and cells with the appearance 
20 and distribution of Kuppfer Cells. In contrast, hepatocytes did not show any significant level of staining. Nor did 
connective tissue show any staining. The negative controls for the experiment used non immune rabbit gamma 
globulin instead of the specific primary antibodies and consistently showed only background staining. Two normal 
biopsies were also reacted with monoclonal antibodies and similar reactivities to galectin-3 as that obtained with the 
polyclonal antibody were obtained. The observed pattern of galectin-3 expression in normal human liver is consistent 
25 with that reported for mouse liver. (Flotte, et al., ^/n.^./'srAo/., 111:112(1983)). 

The most prominent feature observed in the eight biopsies from subjects having cirrhosis of the liver was the 
strong staining of focal regenerating nodules. Variable degrees of staining intensity were seen in different foci of 
regenerating liver. Preferential circumferential expression of galectin-3 in the foci was also observed, suggesting 
differential regulation. Islands of positively stained cells, and general distribution of other positively staining cells were 
30 also observed throughout nodules. In all samples of cirrhotic liver, both cytoplasmic and nuclear expression of 
galectin-3 were observed. Other regions of hepatic tissue do not display this staining pattern. The ductal epithelial 
cells, dendritic cells, and Kuppfer ceils were also positively stained in the tissue specimens from cirrhotic liver. In the 
discussion that follows, we describe an experiment in which we discovered that galectin-3 is expressed in human 
hepatocellular carcinoma. 
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The pattern of galectin-3 expression in human hepatocellular carcinoma was visualized in much the same 
manner as described above for normal and cirrhotic liver. The pattern of galectin-3 expression was determined by 
using an affinity purified rabbit antigalectin-3 antibody as the primary antibody and positive reactivities were 
demonstrated by reddish-brown precipitates (peroxidase-ammoethylcarbazole). In addition, a mouse monoclonal anti- 
5 galectin-3 antibody was used and positive reactivities were observed by brown precipitates (peroxidase- 
diaminobenzidine). In both sets of experiments, counterstaining was done with hematoxylin. 

The immunohistochemicat staining of tumor cells revealed varying degrees of intensity and distribution. 
Based on the intensity and distribution of positive staining regions, scores from 1 to 4 were assigned: One* 
(scattered positive tumor cells); 2+ (diffuse positive or focally positive); 3 + (large areas of positives, diffuse and 
10 focal); and 4+ (mostly positive and strongly positive). In most of the HCC tissues (76%) positive galectin.3 
immunoreactivities of scoring intensity 1 and greater were observed in the neoplastic cells. (See Table I). In each 
HCC tissue sample, localization of galectin-3 was predominantly cytoplasmic but nuclear staining was invariably noted 
in many, cells, and was observed in all samples positive for galectin-3 expression. The neoplastic cells were not 
uniformly positive and tissues from different cases were variable in terms of the fraction of the tumor cells that were 
15 positively stained. In some specimens, however, the neoplastic cells were not stained despite positive staining of the 
bile ducts and Kuppf er cells. Specificity of the rabbit anti-galectin-3 antibody was verified by immunohistochemistry 
using a mouse monoclonal antibody. Two HCC biopsies were stained with both polyclonal and monoclonal antibodies 
and similar staining patterns in tissues expressing the lectin were observed. For most cases of hepatocellular 
carcinoma. HBV serology of the patients was available, and are included in Table I. Positive HBV infection status is 
20 defined by presence of serum HBV surface antigen or antibodies to HBV surface antigen. Positive and negative sera 
status are indicated as and v. respectively, and no signage when no information was available (Table I). Of the 
25 HCC biopsies studied, 18 were from documented HBV seropositive patients. Seventy-eight percent of the 18 
biopsies showed positive immunohistochemical reactivities for galectin-3. Seven cases of HCC occurred in HBV sero 
negative patients, five of which were positively stained for gaiectin-3. Thus, galectin-3 expression in hepatocytes is a 
25 general marker for HCC. and is not limited to HCC resulting from HBV infection. The HCC samples studied for 
galectin-3 expression and the serological status for HBV antigens are summarized in Table I. 
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Table I 



Galectin-3 Immunohistochemical Reactivities in HCC Tissue 
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35 

Samples classified as HBV positive as shown by serology are indicated in bold type. 

1 Fibrolamellar tumor 

2 Hepatoblastoma 

Summary of HCC Statistics 
40 Galectin-3 Positive Galectin-3 Negative 

HBV+ H 4 

HBV-/undetermined 5 2 

Abbreviations: HBsAg, HBV surface antigen; HBeAg, HBV envelope antigen; a-HBs, antibodies to HBV surface antigen; a- 
HBe, antibodies to HBV envelope antigen; a-HBec, antibodies to HBV core antigen. 
45 Another aspect of the present invention relates to methods of determining the pattern of galectin-3 

expression in hepatocellular carcinoma cell lines, comprising identifying whether the level of galectin-3 expression in 
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the cells is above normal. Example 2 describes the characterization of galectin-3 expression in several hepatocellular 
carcinoma cell lines. 

EXAMPLE 2 

The association of galectin-3 expression with HCC was also confirmed in established HCC cell lines by 
imm „„ob«ot. As a control, organ extracts from mouse liver, as well as. thymus and spleen were ana.yzed. The human 
hepatoma cel. lines Hep 3B (HBsAg + ), Hep G2 (Aden, et a... Nature. 282:615 11979)). HuH-7 .Nakabayashi. et a... 
Cancer Res 42:3858 (1982)) and PLC/PRF/5 (HBsAg + ) (Alexander, et al. SJfrJ.Med.ScL, 41:89 (1976)) were 
kindly provided by Dr. F. Chisari. The Scripps Research Institute. La Jolla. California. The M12.4.5 line was a 
generous gift of Dr. Richard Asofsky. National Institutes of Health. Bethesda. Maryland. Cells were cultured m 
Dulbecco's Modified Eagle's Medium supplemented with 10 % fetal bovine serum and 2 mM glutamine. 

Galectin-3 expression in mouse tissues and hepatoma cell lines was detected by immunoblottmg. as 
described previously (Liu, et a... AnUfathol, 147:1016 (1995)). Briefly, cell lysates were adsorbed with lactosyl- 
Sepharose 4B and the bound proteins were eluted and e.ectrophoresed in 12.5 % polyacrylamide gel. The separated 
proteins were transferred to .mmobilon P membranes and galectin-3 was detected by a chemiluminescent system 
using a rabbit anti-ga.ectin-3 antibody and alkaline phosphatase-conjugated goat anti-rabbit antibody. Each lane 
represents the galectin-3 present in 100 „g protein from tissue extracts and 500 „g protein from each cell lysate. 

Protein markers are shown on the left in kDa. 

The control immunoblot showed that galectin-3 was expressed in spleen and thymus extracts but not fiver. 
The immunoblot of the HCC cel. lines showed that galectin-3 was expressed in all four HCC cel. lines tested. The 
identity of the protein was also verified by its lectin activity, as demonstrated by binding to .actosyl-Sepharose 48. co- 
ncretion with the authentic galectin-3 in electrophoresis, and reactivity with specific rabbit antibodies in immunoblot 
Two HCC cell lines used. Hep 3B and PLC/PRF/5 are known to contain integrated HBV sequences (Aden, et al.. 
Nature 282-615 (1979): MacNab, et al.. British J.Cancer. 34:509 (1976)). However, there is no evidence that the 
other cel. lines. Hep G2 and HuH-7 are infected with HBV (Aden, et a... Nature. 282:615 (1979): Nakabayashi, et a... 
25 Cancer Res.. 42:3858 (1982)). Galectin-3 expression appeared to be generally associated with HCC but is not tamted 
to HCC caused by HBV. 

While galectin-3 expression is not limited to HCC resulting from HBV, HBV-X protein induces ga.ect,n.3 
expression, as described in the example below. 

EXAMPLE 3 

30 The expression of galectin-3 in HBV seropositive individuals with HCC prompted us to test whether the 

galectin-3 promoter can be activated by HBV transacting proteins. (See Figure 1). Accordingly, the vector 
P Galec3-CAT. which contains the murine galectin-3 promoter coupled to a chloramphenicoracetyl transferase (CAT) 
reporter gene, was used to co-transfect M12.4.5 cells with a vector expressing the HBV transacting protem HBV- 
X Construction of the vector pGalecS CAT consisting of the murine galectin-3 promoter upstream of a bactenal 

35 chloramphenicol acetyl transferase (CAT) gene was accomplished, as described by Hsu. et al.. Am.J.Pathol.. 



20 
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148:1661 (1996). Briefly, a 2 kb region upstream from the transcription initiation site was inserted into a vector 
housing a CAT gene. The HBV-X expression vector pARVIMT and parent pMT (McLachlan, et al.. J.ViroL, 61:683 
(1987)). used for co-transfections, were generous gifts of Dr. Alan. McLachlan. The Scripps Research Institute, and 
contained the murine metallothionein l gene promoter sensitive to and inducible with metal ions. Co-transfections of 
5 P Galec3-CAT and pARVIMT or pMT into murine M1 2.4.5 cells were accomplished by electroporatipn, as described by 
Hsu. et al., AmJ.Pathol.. 148:1661 (1996). Induction of the metallothionein promoter was initiated, in some cultures 
twenty four hr after electroporation by addition of ZnCI 2 to 10 //M. CAT activities of transf ectants were assayed 48 
hr after transfection using the phase transfer method with [dichloroacetYl-1.2- ,4 C]-chloramphenicol (New Englend 
Nuclear. Boston. MA) and nbutyl coenzyme A (Sigma. St. Louis. MO). Corrections were made for basal activity by 
10 subtracting the activity of the parental pCAT-basic vector from ail other measurements. 

The CAT activity shown in Figure 1 is a measure of the activity of the galectin-3 promoter in the pGatec3- 
CAT vector in M12.4.5 host cells. Cotransfections were performed with the HBV-X expression vector pARVIMT or 
control pMT. Cotransfections performed with 10 //g or 20 m pARVIMT in the presence of zinc are indicated by 
pARVIMT/10 and pARV1MT/20, respectively, and pARV1MT/20(-Zn) when performed with 20 //g vector in the 
1 5 absence of zinc. Significant upregulation of the galectin-3 promoter is observed when co-transfected with pARVIMT. 
but not with pMT. All values were corrected for basal CAT activity reflective of transfection with the promoter-less 
CAT expression vector, and are expressed relative to pGalec3-CAT alone (unity). Data are means of duplicate 
transfections shown with error bars indicating SD and are representative of 2 experiments. 

As shown in Figure 1. co-transfection of the vector pGalec3-CAT and the vector expressing HBV-X protein 
20 resulted in upregulation of the galectin-3 promoter as evident from increased CAT activity. The vector expressing 
HBV-X protein contains a metallothionein gene promoter and is sensitive to and inducible by metal ions. In the 
presence of Zn 2+ . however, suppression of CAT activity was observed, indicating that activation of the murine 
metallothionein ! promoter driving expression of the HBV-X protein had already been achieved. 

The results presented above show that many normal hepatocytes, which do not express galectin-3, are 
25 converted to galectin-3 expressing cells upon neoplastic transformation. Of the HCC biopsies evaluated. 19 of 25 
(76%) were immunohistochemically positive for galectin-3. While not all neoplastic cells in HCC express this lectin 
and the cases of HCC studied fall into a spectrum in terms of the proportion of tumor cells that are positively stained 
for oalectin-3; galectin-3 expression is associated with HCC and cirrhosis of the liver. The presence of galectin-3 in 
HCC cell lines indicates that upregulation of this lectin occurs in transformed hepatocytes. 
30 Analysis of galectin-3 expression in HCC from patients with or without serological evidence of HBV infection 

revealed that galectin-3 expression in HCC is not strictly correlated with HBV infection (positive galectin-3 expression 
in 78% of HBV+ patients and 71% of HBV- patients). In addition, since at least one of the cell lines shown to express 
galectin-3 does not appear to exhibit evidence of HBV infection, /the data suggest that factors other than HBV 
infection can result in galectin-3 expression. Additional evidence indicating non-viral upregulation of galectin-3 
35 expression in hepatic tissue exists in the observation that fibrolamellar tumors also express galectin-3. These tumors 
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generally occur in young patients, are not associated with chronic infection with HBV, and do not usually appear in 
cirrhotic liver (Berman, et al.. Cancer, 46:1448 (1980): Craig, et al.. Cancer, 46:372 (1980)). 

The following discussion is offered only to provide possible explanations for the relationship of expression of 
galectin-3 to neoplasia. These explanations are not intended to limit any aspect of the present invention nor are they 
5 intended to be used to construe, interpret, or restrict the scope of protection provided by the claims and/or the scope 
of equivalent compositions and methods. 

Results from our co-transfection experiments show that the upregulation of galectin-3 in HBV infected and 
transformed cells can result from transactivation of the galectin-3 promoter by the HBV-X protein. No direct 
sequence-specific interaction of this protein occurs with ONA (Wu, et aL, Cell, 63:687 (1990)). The effects of HBV-X. 
10 however, can be mediated by interactions with transcription factors CREB and ATF2 {Maguire. et aL. Science, 
252:842 (199D). In addition, HBV-X can transactivate ONA elements regulating viral and cellular genes such as the 
kB sequence bound by the transcription factor NF-kB (Twu, et al., Proc.NatlJcad.Sci.USA, 86:5168 (1989). the c- 
myc gene promoter (Koike, et aL, Molec.Biol.Med., 6:151 (1989) and binding sites for API and AP2 transcription 
factors (Seto. et aL. Nature. 344:72 (1 990)). Indeed, several potential DNA sites for interaction with NF-kB. AP1 and 
15 AP2 were identified within 1 kb upstream of the transcription initiation site. The transactivation of the galectin-3 
gene by HBV-X is. therefore, consistent with our data demonstrating upregulation of this promoter by HTLV-I Tax 
(Hsu, et aL, AmJ.Pathol., 148:1661 (1996), a potent viral transactivator through the CREB/ATF transcriptional factor 
pathway. Furthermore, upregulated expression of galectin-3 was also demonstrated in 3T3 cells following 
transformation with the Kimen murine sarcoma virus (Crittenden, et aL. MnLCeBBioL. 4:1252 (1984)). Additionally, 
analysis of fibroblasts from mouse and chicken for galectin-3 expression showed that infection with viruses can result 
in upregulated expression of the lectin (Crittenden, et aL. MoJ.Ce/l Biol., 4:1252 (1984)). In sum. we believe that 
galectin-3 expression may be affected by many viruses. 

The finding that regenerating nodules in cirrhotic liver express galectin-3 is also worthy of comment. Since 
these hepatocytes are undergoing rapid proliferation, we believe that this proliferative cell status is linked to galectin- 
25 3 expression. Previously, others reported that elevated levels of this lectin are found in proliferating fibroblasts 
(Moutsatsos. et aL. ProcJJatlMad.Sci. USA. 84:6452 (1 987)). An alternative but not mutually exclusive explanation 
may be the following. Because oval cells expressing markers for both bile duct epithelia and hepatocytes (Gerber. et 
al . Am JJ»athoL. 110:70 (1983)) are hypothesized to f unction as hepatic stem cells (Fausto, N. and Thompson, N.U 
Purification and culture of oval cells from rat liver. In: Pretlow II. T.G. and T.P. Pretlow (W). Cell Separation: 
Methods and Selected Applications, pp. 45-96, Academic Press, Orlando (1986)) and since galectin-3 is expressed in 
cells of ductal origin, presence of this protein in these nodules may be residual lectin resulting from the production of 
hepatocytes from these stem cells. In either event, we believe that regulation of galectin-3 expression during 
differentiation of hepatocytes relates to the fate of the cells, and deregulation, as a result of viral influences, 
interferes with this process. Further, we contemplate that cells positively stained f ofgalecfitf 3iire in the"process of 
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becoming cancer cells and that expression of galectin-3 represents an early event leading to tumor transformation. 
Alternatively, expression of galectin-3 in cirrhotic cells may indicate conversion to normalcy. 

In the following discussion, weprovide several diagnostic and prognostic tools for use with embodiments of 

the present invention. 

5 

Diagnostic tools 

While normal hepatocytes do not express galectin-3, expression of this lectin is associated with human 
hepatocellular carcinoma (HCC). Lectin expression in HCC is positively influenced by hepatitus B virus (HBV) infection 
through a mechanism that includes transactivation of the galectin-3 gene promoter. Other areas in the liver found to 
1 0 express galectin-3 are focal regenerating nodules of cirrhotic tissue. We contemplate that deregulated expression of 
galectin-3 results in transformation, conversion of tumors to increased invasiveness, and confers preferential survival 
of tumors. Accordingly, several diagnostic and prognostic tools which detect expression of galectins in the liver may 
be used to determine whether an individual is suffering from HCC or cirrhosis of the liver or is likely to suffer from 
HCC or cirrhosis of the liver in the future. 
15 Generally, the diagnostics and methods of use thereof can be classified according to whether the diagnostic 

detects the expression of galectin-3 protein in a biological sample (e.g., a sample having hepatocytes) or the expression 
of galectin-3 RNA in a biological sample. Accordingly, the level of expression of galectin-3 protein or RNA in a biological 
sample indicates a predilection to liver disease (e.g.. cirrhosis or HCC). Once a biological sample from a subject in need 
of testing is obtained, many different techniques can be used to detect the level of galectin-3 protein and/or RNA 
20 expression including, but not limited to, antibody-based detection techniques, bacteriophage display techniques, lectin- 
binding techniques, hybridization techniques, and enzymatic digestion (e.g.. RNAse protection) techniques. 
Additionally, we contemplate the use of physical detection methods including, but not limited to, absorption, emission, 
• or resonance spectra, and the identification of galectins by nucleic acid or protein fragmentation patterns (e.g., 
enzymatic or chemical cleavage patterns). Some of these techniques involve disposing the proteins and/or nucleic 
25 acids present in the biological sample on a support, and contacting the support with detection components such as 
antibodies to galectin* or nucleic acid probes complementary to galectin-3 mRNA. Desirably, the levels of expression 
of galectin-3 protein and/or RNA from diseased and normal individuals is compared to the level detected in the subject 
tested. 

Additionally, we contemplate the preparation of diagnostic kits comprising detection components such as 
30 antibodies specific for the gaiectin -3 protein or nucleic acid probes for detecting RNA encoding galectin-3. The 
detection component will typically be supplied in combination with one or more of the following reagents. A support 
capable of absorbing or otherwise binding RNA or protein will often be supplied. Available supports for this purpose 
include, but are not limited to. membranes of nitrocellulose, nylon or derivatized nylon that may be characterized by 
bearing an array of positively charged substituents. One or more enzymes, such as Reverse Transcriptase and/or Taq 
35 polymerase, may be furnished in the kit, as may dNTPs, buffers, or non-human polynucleotides like calf-thymus or 
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salmon-sperm DNA. Results from the kit assays can be interpreted by a healthcare provider or a diagnostic 
laboratory. Alternatively, diagnostic kits are manufactured and sold to private individuals for self-diagnosis. 

In many aspects of the present invention, galectins or RNA encoding galectins obtained from a biological 
sample are disposed on a support so that the level of galectin expression can be rapidly determined. The term 
5 "matrix" or "support" refers to a carrier, a resin or any macromoiecular structure used to attach, immobilize, or 
dispose thereon a biomolecule such as RNA or protein. Solid supports include, but are not limited to, the walls of 
wells of a reaction tray, test tubes, polystyrene beads, magnetic beads, nitrocellulose strips, membranes, 
microparticles such as latex particles, sheep (or other animal) red blood cells, duracytes® and others. Additionally, 
organic carriers including proteins and oligo/polysaccarides (e.g. cellulose, starch, glycogen, chitosane or aminated 
10 sepharose) and inorganic carriers such as silicon oxide material (e.g. silica gel, zeolite, diatomaceous earth or aminated 
glass) are contemplated. Furthermore, in some embodiments, a liposome or lipid bilayer (natural or synthetic) is 
contemplated as a support. Desirable supports also include polyacrylamide gels, agarose gels, composite gels, and 
other gel matrices, papers, chips, membranes, chromatography matrices, as used in thin layer chromatography, and 
resins or beads, as used in affinity chromatography. 
15 jhe support may have a hydrophobic surface which interacts with a portion of the biomolecule by 

hydrophobic non-covaierit interaction. For example, the hydrophobic surface of the support may be a polymer such as 
plastic or any other polymer in which hydrophobic groups have been linked such as polystyrene, polyethylene or 
polyvinyl. The support may also have a charged surface which interacts with the biomolecule such as, a charged 
nitrocellulose or nylon membrane. The supports may have other reactive groups which can be chemically activated so 
20 as to attach a biomolecule. For example, cyanogen bromide activated matrices, epoxy activated matrices, thio and 
thiopropyl gels, nitrophenyl chlorof ormate and N-hydroxy succinimide chlorf ormate linkages, and { oxirane acrylic 
supports are common in the art. (Sigma) 

Any biomolecule which interacts with a galectin or a nucleic acid sequence complementary to a nucleic acid 
encoding a galectin (e,g., antibody, phage, p-galactoside sugar, other galectin ligand, or DNA or RNA), may be 
25 attached in overlapping areas or at random locations on the solid support and can be used to probe for the presence of 
a galectin or RNA encoding a galectin in a biological sample. Additionally, biomolecuies which interact with galectins 
or nucleic acid sequences encoding galectins may be attached in an ordered array wherein each polynucleotide is 
attached to a distinct region of the solid support which does not overlap with the attachment site of any other 
biomolecule. Preferably, such an ordered array of biomlecules is designed to be "addressable" where the distinct 
30 locations are recorded and can be accessed as part of an assay procedure. Addressable biomolecule arrays typically 
comprise a plurality of different biomolecule probes that are coupled to a surface of a substrate indifferent known 
locations. The knowledge of the precise location of each biomolecule location makes these "addressable" arrays 
particularly useful in binding assays. 

Any addressable array technology known in the art can be employed with this aspect of the. invention. One 
35 particular embodiment of polynucleotide arrays is known as the Genechips™, and has been generally described in US 
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Patent 5,143,854; PCT publications WO 90/15070 and 92/10092. These arrays may generally be produced, using 
mechanical synthesis methods or light directed synthesis methods, which incorporate a combination of 
photolithographic methods and solid phase oligonucleotide synthesis {Fodor et al., Science. 251:767-777, 1991), The 
immobilization of arrays of oligonucleotides on solid supports has been rendered possible by the development of a 
5 technology generally identified as "Very Large Scale Immobilized Polymer Synthesis" (VLSIPS™) in which, typically, 
probes are immobilized in a high density array on a solid surface of a chip. Examples of VLSIPS™ technologies are s 
provided in US Patents 5.143,854 and 5,412,087 and in PCT Publications WO 90115070, WO 92/10092 and WO 
95/11995, which describe methods for forming oligonucleotide arrays through techniques such as light-directed 
synthesis techniques; In designing strategies aimed at providing arrays of nucleotides immobilized on solid supports, 
10 further presentation strategies were developed to order and display the- oligonucleotide arrays on the chips in an 
attempt to maximize hybridization patterns and sequence information. Examples of such presentation strategies are 
disclosed in PCT Publications WO 94/12305, WO 94/1 1530, WO 97/29212 and WO 97/31256. , 

In the following discussion, we provide methods and compositions which are used to detect the presence of 
galectin-3 protein in a biological sample so that the predisposition to HCC can be determined. 
15 Detection of galectin-3 ' expression using protein-based diagnostics 

By one approach galectin -3 expression is determined by using protein-based diagnostics. The presence and 
amount of galectin -3 protein in a biological sample can be detected by screening for the presence of the protein using 
conventional assays. Antibodies (monoclonal or polyclonal) immunoreactive with galectin -3 protein can be used to 
screen biological samples such as serum, urine, ascites, cerebrospinal fluids or any other body fluid. Preferably, liver 
20 ceils, extracts of liver cells, or liver biopsy tissue, as described above, are screened for the presence and amount of 
galectin-3 Portions; fragments, or antibody derivatives are contemplated for use with several embodiments of the 
present invention; Further, phage display or galactin-3 binding assays can be used to detect the presence and amount 
of galectin-3 in a biological sample. Such immunological assays can be done in many convenient formats and are 
known to those of skill in the art. 
25 in some embodiments, antibodies reactive to galectin -3 are used to immunoprecipitate galectin -3 protein 

from a liver cell homogenate. In others, Western or Immunoblots of proteins isolated from liver cells are performed 
using anti-galectin-3 antibodies. In another preferred embodiment, antibodies will detect galectin-3 in paraffin or 
frozen sections of liver tissue, using immunocytochemical techniques. Preferred embodiments relating to methods for 
detecting the presence and amount of galectin -3 protein also include enzyme-linked immunosorbent assays (ELISA), 
30 radioimmunoassays (RIA), immunoradiometric assays (IRMA), fluorescent activated cell sorting (FACS), and 
immunoenzymatic assays (IEMA). including sandwich assays using monoclonal and/or polyclonal antibodies. 
Exemplary sandwich assays are described by David et al., in U.S. Patent Nos. 4,376,1 10 and 4.486,530. 

The presence of galectin-3 in a sample may also be determined using affinity chromatography columns 
having a p-gaiactoside bound thereto. The proteins present in a biological sample are applied to a support having a p- 
35 galactoside sugar (e.g., a lactosyl-SEPHAROSE resin) under conditions which allow galectin-3 to bind to the support. 
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The presence and amount of galectin-3 can be determined directly using antibodies or phage display or the bound 
galectin-3 can be eluted by using a (J-galctoside sugar (e.g.. 0.1M lactose). Many more methods for accessing the 
ability galectin-3 protein to bind a p-galactoside sugar are known to the art. 

In order to provide a basis for diagnosis, normal or standard values for galectin-3 expression of a subject is 
5 preferably established. This can be accomplished in many ways and one approach is to contact body fluids, cell 
extracts, or biopsy tissue from normal subjects with antibody to galectin-3 under conditions suitable for complex 
formation. The amount of antibody bound to normal fluids, cells, and tissues (e.g., liver celts and liver tissue) is 
quantified and recorded using conventional immunological techniques. This measurement can serve as a baseline to 
which biological samples from subjects having HCC or cirrhosis of the liver or likely to develop HCC or cirrhosis of the 
10 liver can be compared. Additionally, databases having measurements of galectin-3 expression of several afflicted 
individuals (similar to that shown in Table 1) are valuable standards by which the progression of liver disease, based 
on galectin-3 expression, can be monitored. In this manner, deviation between the standard and. the subject values 
establishes the presence and severity of disease state. 

In the following discussion, we provide methods and compositions which are used to detect the presence of 
1 5 RNA encoding galectin-3 in a biological sample so that the presence of or predisposition to HCC and cirrhosis can be 
determined. 

Detection of galectin-3 expression using RNA-detection diagnostics 

A wide variety of labels and conjugation techniques are known by those skilled in the art and can be used in 
various nucleic acid assays. There are several ways to produce labeled nucleic acids for hybridization or PCR 
20 (Polymerase Chain Reaction) including, but not limited to. oligolabeling. nick translation, end-labeling, or PCR 
amplification using a labeled nucleotide. Alternatively, a desired galectin sequence, or any portion of it. may be cloned 
into a vector for the production of an mRNA probe. Such vectors are known in the.art, are commercially r available. and 
may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3 or SP6 
and labeled nucleotides. A number of companies such as Pharmacia Biotech (Piscataway N.J.), Prqmega (Madison 
25 Wis.), and U.S. Biochemical Corp (Cleveland Ohio) supply commercial kits and protocols for these procedures. Suitable 
reporter molecules or labels include those radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic 
agents as Well as substrates, cofactors, inhibitors, magnetic particles and the like. 

For diagnostic and prognostic purposes, nucleic acid probes having a sequence complementary to a nucleic 
acid encoding a galectin may be used to detect and quantitate gene expression of the galectin in biological samples 
30 including, but not limited to biopsied tissues or any of the biojogicai samples discussed above. Preferably, nucleic acid 
probes which are complementary to mRNA encoding galectin-3 are used to screen for polynucleotides in, Hver biopsy 
tissue, liver cell extracts; or other biological samples. RNA-detection-based diagnostic assays, such j; as Northern 
hybridization. Northern dot blots, RNA in situ hybridization, and EUSA assays, are particularly useful to distinguish 
between the absence, presence, and ex^ regulation of 

35 galettin levels during therapeutic intervention. 
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Included in the scope of the invention are the use of oligonucleotide sequences, antisense RNA and DNA 
molecules and PNAs Which complement galectin sequences, particularly galectin-3. for the determination of gatectin , 
expression in liver cells' and tissue by RNA-based detection techniques. These forms of polynucleotide sequences 
encoding galectin-3 may also be used for the diagnosis of conditions or diseases with which the expression of galectin- 
3 is associated. For example, polynucleotide sequences complementary to mRNA encoding galectin-3 may be used in 
hybridization or PCR assays of fluids or tissues from biopsies to detect galectin-3 expression and thereby identify the 
subject as having a predisposition or established case of cirrhosis of the liver and/or HCC. The form of such 
qualitative and/or quantitative methods may include northern analysis, dot blot or other membrane-based technologies; 
PCR technologies; dip stick, pin, chip and ELISA technologies. All of these techniques are well known in the art and 
are the basis of many commercially available diagnostic kits. 

In one aspect. RNA probes complementary to galectin-3 mRNA are used in assays that detect activation or 
induction associated with disease (including cirrhosis and HCC). Accordingly, the nucleotide or polypeptide sequences 
of galectin-3 (referenced in Robertson et al., Biochemistry 29: 8093-8100 (1990) or Genbank Accession J02921, 
M57710. 4504982, or NID g4504982, g1 79530. NM_002306.1 and NP_002297), the disclosure of which are 
incorporated herein by reference in their entireties, is used to design suitable RNA probes. (See Figure 2). The RNA 
probes are labeled by methods known in the art and are added to a DNAse treated fluid or tissue sample from a 
patient under conditions suitable for the formation of hybridization complexes. ' Hybridization on complexes are. 
isolated or the sample is treated with an agent which removes unhybridize* nucleic acids. After an incubation period, 
the sample is washed with a compatible fluid which optionally contains a dye (or other label requiring a developer) if 
the nucleotide has been labeled with an enzyme. After the compatible fluid is rinsed off, the dye is quantitated and 
compared with a standard. If the amount of dye in the biopsied or extracted sample is significantly elevated over that 
of a comparable control sample, the nucleotide sequence has hybridized with RNA in the sample, and the presence of 
elevated levels of RNA encoding gaiectin-3 in the sample indicates the presence of liver diseese. 

Such assays may also be used to evaluate the efficacy of a particular therapeutic treatment regime in animal 
studies, in clinical trials, or in monitoring the treatment of an individual patient. In order to provide a basis for the 
diagnosis of disease, a normal or standard profile for human galectin-3 expression in liver cells, extracts or tissue must 
be established. This is accomplished by combining body fluids or cell extracts taken from normal subjects with RNA 
probes encoding galectin-3. or a portion thereof, under conditions suitable for hybridization. Standard hybridization 
may be quantified by comparing the values obtained for normal subjects with a dilution series of galectin-3 RNA run in 
the same' experiment where a known amount of substantially purified galectin-3 RNA is used. Standard values 
obtained from normal samples are then compered with values obtained from samples from subjects thought to be 
afflicted with liver disease. Deviation between standard and subject values establishes the presence of disease. 

Additionally. PCR methods which may be used to quantitate the expression of a particular molecule include 
radiolabeling (Melby P. C. et al. J Immunol Methods 159:235-44 (1993)) or biotinylating nucleotides (Duplaa C. et al. 
Anal Biochem 212:229-236 (1993)), coamplification of a control nucleic acid, and standard curves onto which the 
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experimental results are interpolated. Quantitation of multiple samples may be speeded up by running the assay in an 
EUSA format where the oligomer of interest is presented in various dilutions and a spectrophotometry or colorimetric 
response gives rapid quantitation. A definitive diagnosis of this type may allow health professionals to begin 
aggressive treatment and prevent further worsening of the condition. Similarly, further assays can be used to monitor 
the progress of a patient during treatment. 

Once disease is established, a therapeutic agent is administered and a treatment profile is generated. Such 
assays may be repeated on a regular basis to evaluate whether the values in the profile progress toward or return to 
the normal or standard pattern. Successive treatment profiles may be used to show the efficacy of treatment over a 

period of several days or several months. 

As mentioned above, PCR technology may be used to identify and quantitate expression of galectins, 
particularly galectin-3. For amplification of mRNAs, ft is within the scope of the present invention to reverse 
transcribe mRNA into cONA followed by PCR (RT-PCR); or, to use a single enzyme for both steps as described in U.S. 
Patent No. 5.322.770. the disclosure of which is incorporated herein by reference in its entirety, or, to use Reverse 
Transcriptase Asymmetric Gap Ligase Chain Reaction (RT-AGLCR). as described by Marshall R.L et al. \PCR Methods 
15 and Applications 4:80-84. 1994). the disclosure of which is incorporated herein by reference in its entirety. 

A variety of PCR techniques are familiar to those skilled in the art. For a review of PCR technology, see 
Molecular Cloning to Genetic Engineering White, BA Ed. in Methods in Molecular Biology 67: Humana Press, Totowa 
(1997), the disclosure of which is incorporated herein by reference in its entirety and the publication entitled "PCR 
Methods" and Applications" (1991. Cold Spring, Harbor laboratory Press), the. disclosure of which is incorporated 
herein by reference in its entirety. In each of these PCR procedures, PCR primers on either side of the galectin-3 
sequence to be amplified are added to a suitably prepared nucleic acid sample along with dNTPs and a thermostable 
polymerase suclras Taq polymerase, Pfu polymerase, or Vent polymerase. The nucleic acid in the sample is denatured and 
the PCR primers are specifically hybridized to complementary nucleic acid sequences in the sample. The hybridized primers 
are extended. Thereafter, another cycle of denaturation, hybridization, and extension is initiated. The cycles are repeated 
25 multiple times to produce an amplified fragment containing the nucleic acid sequence between the primer sites. PCR has 
further been described in several patents including US Patents 4,683.195, 4,683.202 and 4,965,188. the disclosure 
of which is incorporated herein by reference in its entirety. 

The primers are selected to be substantially complementary to a portion of the sequence of galecting-3 
mRNA and a portion of the sequence complementary to the sequence of galectin-3 mRNA. thereby,, allowing the 
30 sequences between the primers to be amplified. The length of the primers for use with aspects of the present 
invention can range from 8 to 100 nucleotides, preferably from 8 to 50. 8 to 30 or more, preferably 8 to 25 
nucleotides. Shorter primers tend, to lack specificity for a target nucleic acid sequence and generally require cooler 
temperatures to form sufficiently stable hybrid complexes with the template. Longer primers are expensive to produce 
arid can sometimes self-hybridize to form hairpin structures. The formation of stable hybrids depends on the melting 
35 temperaturelTm) of the DNA. The Tm depends on the length of the primitive ionic strength. of the solution and the 
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G+C content. The higher the G+C content ofthe primer, the higher is the melting temperature because G:C pairs are 
held by three H bonds whereas A:T pairs have only two. The G+C content of the amplification primers of the present 
invention preferably ranges between 10 and 75 %, more preferably between 35 and 60 %, and most preferably 
between 40 and 55 %. The appropriate length for primers under a particular set of assay conditions may be 
empirically determined by one of skill in the art. 

The spacing of the primers determines the length of the segment to be emplified. In the context of the 
present invention amplified segments carrying nucleic acid sequence encoding a galectin can range in size from at least 
about 25 bp to 35 kbp. Amplification fragments from 25-3000 bp are typical, fragments from 50-1000 bp are 
preferred and fragments from 100-600 bp are highly preferred. It will be appreciated that amplification primers for a 
specific galectin may be any sequence which allows the specific amplification of any DMA fragment carrying nucleic 
acid sequence unique to the particular galectin. Amplification primers may be labeled or immobilized on a solid support 
as described above.. : . 

In the following discussion, we provide methods and compositions which are used to detect patterns of 
galectin expression in a biological sample so that the predisposition to disease can be determined. 
15 Detection of patterns of galectin expression to identify predisposition to disease 

Another aspect of the present invention is a method of correlating the ratio between the expression levels of 
a plurality of galectins with a diseese state comprising obtaining biological samples from individuals suffering from the 
disease and normal individuals, determining the expression levels of two or more galectins in the samples, and 
determining whether there is a statistically significant association between the ratio of galectin expression and the 
20 disease stete. Statistically significant associations can be determined using statistical methods familiar to those 
skilled in the art, including chisquared analyses. In some embodiments, the disease state is cancer. Galectin 
expression may be measured using any of the methods described above for detecting galecting-3 protein or mRNA. 
Preferably, the galectins whose expression levels are measured to determine the ratios are selected from the group 
consisting of galectins MO. Alternatively, the galectins may be galectins which are currently unknown but which are 
25 identified based on their possession of one or more of the homology regions presented in Table III. Table III shows 
examples of conserved galectin residues from members of different animal species; the conserved residues are boxed. 
Similar patterns of amino acid conservation in context of each other can be used to differentiate galectin homologs in 
one animal species. Preferably, galectin expression is measured using the protein-based or nucleic acid based 
detection methods described ebove for gaiectin-3. 

The overexpression, underexpression, or, sometimes, any expression of members of the galectin family is 
associated with several disease states and most acutely cancer. Galectin-1 and galectin-3 expression, for example, 
have been associated with tumor metastasis. (Raz et el.. Cancer Res. 46:3667(1986) end Raz et al.. Int. J. Cancer 
46:871 (1990). Galectin 5 is thought to mediate cell adhesion and galectin-5H is believed to be involved with the 
onset of inflemmetory disease and several forms of neoplastic disease. (HiBman et aL, U.S. Pat. No. 5,837,493). The 
35 expression of galectin-7 has been observed to be downregulated in a malignant keratinocyte cell line and is thought to 
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be required for the maintenance of normal keratinocytes. (Hillman et al.; U.S. Pat. No; 5,837,493). The appearance 
of gaiectin 8 and 9 have also been associated with the onset of cancer. In the table below, several tumors that have 
been associated with the expression of various galectins are listed. 



Table II 

Tumor Type 

Bladder carcinoma 

Anaplastic large cell 
lymphoma 

Colorectal carcinoma 
Colorectal carcinoma 

Ovarian carcinoma 

Breast carcinoma 

Thyroid carcinoma 
Colorectal carcinoma 
Various 

Prostate Carcinoma 
Hodgkin's lymphoma 



Association of Galectins with Neoplasms 
Gaiectin Normal Tissue Expression 

1 Expressed 

3 Absent in T lymphocytes 

3 Expressed 

3 Nuclear and cytoplasmic 

3 Expressed 

3 Expressed 

3 Expressed 

4 Expressed 

7 Expressed in stratified epithelia 

8 Absent 

9 Absent 



Because the family of galectins share several conserved domains, the 
family members will be forthcoming. (See Table 3). Many more tumor markers 
10 disease will be found. 



Properties 

Upregulated in high-grade 
tumors 

Tumor marker 

Downregulated in 
adenocarcinoma 
Conversion from nuclear 
to cytoplasmic with tumor 
progression 
Downregulated in 
carcinoma 
Downregulated in 
carcinoma 
Tumor marker 
Downregulated 
Downregulation antigen 
Metaplasiogenic marker 
Tumor marker 
Tumor marker 

elucidation of many more gaiectin 
and/or markers for other forms of 
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TABLE III 

cegal ' ' ' 

(Seq. ID NO. 1) Chg2l3 MQAMKARCWQ PHWMLPLLPL SSPLHPQLSD ALPAHNPGAP^ PPQGWNRPPG 

ragal4 — " ~ " ' ~"~ ^TIZZZZZZ 

raugal4 — — - — — — — * -~ ^ 

hugall ' ~ ~ "~ : 

hugal4 " — — ™ *~~ ' 

■ . , 51 HI 

cegal — -™ ; — ~ " * ' ~" r ~ : ~™"~ 

eelgal - .7 * " "~~ 
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(Seq. ID No. 2) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 



(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 



(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 



(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 



ID No. 
ID No. 
ID NO. 
ID No. 
ID Np. 
ID No. 
ID No. 



ID No. 
ID No. 
ID NO. 
ID No. 
ID No. 
ID No. 
ID No. 



ID No. 
ID No. 
ID NO. 
ID No. 
ID No. 
ID No. 
ID NO. 



ID NO. 
ID No. 
ID No. 
ID No. 
ID No. 
ID No. 
ID No, 



3) 
4) 
5) 
6) 
.7) 
8) 
9) 



(Seq. ID No. 



(Seq. 
(Seq. 



ID No. 
ID No. 



(Seq. ID No. 



(Seq. ID No. 



chgal3 
ragal4 
mugal4 
hugall 
huga!4 



cegal 
eelgal 
chgal3 
ragal4 
mugal4 
hugall 
hugal4 



PGAFPAYPGY PGAYPGAPGP YPGAPGPHHG PPGPYPGGPP GPYPGGPPGP 



101 150 

MSAEEP KSYP..VPYR SVLQEKF. . . EPGQTL 

. SGGLQVK NFDFTVGKFL 

YPGGPPGPYP GGPTAPYSEA PAAPLKVPYD LPLPAGLMPR LL ....... I 

AYVPAPGYQP TYNP.TLPYK RPIPGGL ..SVGMSI 

^ i PIPGGL. . . . SVGMSV 

. ~ MACGLVAS NLNLKPGECL 

M AYVPAPGYQP TYNP.TLPYY QPIPGGL. NVGMSV 



10) cegal 

11) eelgal- 

12) chgal3 

13) ragal4 

14) mugal4 

15) hugall 

16) hugal4 



17) cegal 

18) eelgal 

19) chgal3 

20) raga!4 

21) mugal4 

22) hugall 

23) hugal4 



24) cegal 

25) eelgal 

26) chgal3 

27) raga!4 

28) mugal4 

29) hugall 

30) hugal4 



(Seq. 
(Seq. 



ID NO. 
ID No. 



(Seq. ID No. 



(Seq. ID No. 



(Seq. 
(Seq. 



ID No. 
ID No. 



151 

IVKGSTID. . 
TVGGFINNSP 
TITGTVNSNP 
YIQGIAKD. . 
YIQGMAKE. . 
RVRGEV. .AP 
YIQGVASE. . 

201 

VLNSF. . .SN 
VMNSTLKGDN 
VCNSMF. . QN 
VFNTM . . . QS 
VFKTM. . .QS 
VCNS. .K.DG 
VFNTL. . .QG 

251 

EFKDYEHR.L 
NL.EFPNR.Y 
HLLQFNFREK 
PFYEYGHR.L 
SFYEYGHR.L 
EF.KFPNR.L 
PFYEYGHR . L 



ESQRFTINLH 
. . QRFSVN . . 
N. . RFSLDFK 
NMRRFHVNF. 
NMRRFHVNF. 
DAKSFVLN . . 
HMKRFFVNF. 



SKTADFSGND 
. VGESM. .NS 
R.GQD. . . . . 
AVGQD.EGAD 
AVGQD.DGAD 
. LGKDS . .NN 
WGQD.PGSD 



VPLHVSVRFD 
LSLHLDHRFN 
IAFHFNPRFK 
IAFHFNPRFD 
VAFHFNPRFD 



200 

E.G. . . K . .1 
.YGAD.QNTI 
E. . . DHKRVI 
. .GWD.K. .V 
. .GWD.K. .V 



LCLHFNPRFN AHG.D.ANTI 
VAFHFNPRFD . . GWD . K . . V 



GEWGKEERK. 
G.WETEQRST 
N . WGKEERT A 
GQWGKEEKKK 
GQWGKEEKKK 
GAWGTEQREA 
GKWGSEERKR 



PLSSISHLSI 
SKEFLPFLSL 
KLNGITKLCI 
PLQMVTHLQV 
PVQMVTHLQV 
NLEAINYMAA 
PLQMVTHLQV 



. SNPIKKGDS 
. NFTLSAGQY 
PRFPFEPGTP 
. SMPFQKGHH 
. SMPFQKGKH 
. VFPFQPGSV 
. SMPFKKGAA 



FDIRIRAHDD 
FEITLSYDIN 
FKLQVLCEGD 
FELVFMVMSE, 
FELVFMVMPE 
AEVCITFDQA 
FELVFIVMAE 



250 

RFQI . IVDHK 
KFYIDILDGP 
HFKVAV.NDA 
HYKV.WNGT 
HYKV.WNGN 
NLTVKLPDGY 
HYKV.WNGN 



300 

DGDLYLNHV . . HW . GGK Y 

AGDARLTLV. . K. E ~ : - 

AGDITLTSVL TSMI 

DGDLELQS I . . NFLGGQPAA SQ YPGTMTIP 

DGDLELQSI. . NFLGGQPAA APYAGAMTIP 

DGDFKIKCV . . AFD ' 

DGDLQLQSI. . NFIGGQPLR PQ. .GPPMMP 



(Seq. ID No. 



301 350 

31) cegal .YP : V PYESGLANGL PVGKSLLVFG 

eelgal — — ■ ' : — - — 

chgal3 * — 

32) ragal4 AYP..SAGYN PPQMNSLPVM AGPPIFNPPV PYVGTLQGGL TARRTIIIKG 

33) mugal4 AYPAGSPGYN PPQMNTLPVM TGPPVFNPRV PYVGALQGGL TLPRTIIIKG 
hugall • — 1 

34) hugal4 PYP. .GPGHC HQQLNSLPTM EGPPTFNP.V PYFGRLQGGL TARRTIIIKG 

351 400 

35) cegal TVEKKAKRFH VNL.LRKNGD ISFHFNPRFD EKHVIRNSLA ANEWGNEERE 

eelgal * — — ' «.« — ~~ 

chgal3 *" — * — — 

36) ragal4 YVLPTAKNLI INFKVGSTGD IAFHMNPRIG D . CWRNSYM NGSWGSEERK 

37) mugal4 YVLPTARNFV INFKVGSSGD IALHLNPRIG D.SWRNSFM NGSWGAEERK 
hugall — . — • 

38) hugal4 YVPPTGKSFA INFKVGSSGD I ALH IN PH. G NGTWRNSLL NGSWGSEEKK 

401 450 

39) cegal . GKNPFEKGV G . . FDLVIQN EEYAFQVFVN GERYISFAHR ADPHD.IAGL 

eelgal -~ " 

chga!3 — — ■ — — - — »- — « — — -~ •■ ■ — — — * — 

40) ragal4 IPYNPF..GA GQFFDLSIRC GTDRFKVFAN GQHLFDFSHR FQAFQRVDML 

41) mugal4 VAYNPF..GP GQFFDLSIRC GMDRFKVFAN GQHLFDFSHR FQAFQMVDTL 
hugall ■ ~ — ~ 

42) hugal4 TTHNPF. .GP GQFFDLSIRC GLDRFKVYAN GQHLFDFAHP SRAFQRVDTL 
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20 



25 



451 465 

(Seq- ID No. 43) cegal QISGOIELSG IQIQ 

eelgal 

chgal3 

(Seq. ID No. 44) ragal4 EIKGDITLSY VQI~ 

(Seq. ID No. 45) mugal4 EINGDITL 

hugall -.- 

, (Seq. ID No. 46) hugal4 EIQGDVTLSY VQI-- 

(Seq. ID No. cCAGCCJACGAGCGGAAZ^TGGCAGACy^T^ 

GGAAACCCAAACCCTCAAGGATGGCCTGGCGCATGGGGGAACCAGCCTGCTGGGGCAGGG 
GGCTACCCAGGGGCTTCCTATCCTGGGGCCTACCCCGGGCAGGCACCCCCAGGGGCTTAT 
CCTGGACAGGCACCTCCAGGCGCCTACCATGGAGCACCTGGAGCTTATCCCGGAGCACCT 

gSccSScccag^^ 

Sgtgcccccggagcctaccctgccactggcccctatggcgcccctgctgggccactg 
a^gccttataacctgcctttgcctgggggagtggtgcctcgcatgctgataacaatt 

Ai I _i„ - ^^mm « »rir.i ziTTrirTTTAGATTTCCAAAGAGGGAATGAT 




TATTACAGTGAAAG 



30 



35 



40 



45 



50 



cegal 


- nematode 


eelgal 


- eel 


chga!3 


- chicken 


ragal4 


- rat 


mugal4 


- mouse 


hugall 


- human 


huga!4 


- human 



In another aspect of the present invention, we contemplate the identification of patterns of galectin 
expression so as to identify a disease state. When the expression of all members of the galectin family are analyzed, a 
predisposition for various diseases can be ascertained. In particular, the complete expression pattern of the galectin 
family in all normal and tumor tissues will indicate both preneoplastic and post neoplastic events. 

By one approach, diagnostic tools which simultaneously screen for all galectin- family members are 
developed. Current technology, such as gene array scanning systems permits the execution of our strategy due to its 
inherent large-scale screening abilities, accuracy, generation of quantitative information and efficiency in both 
preparation of diagnostic arrays and data scanning and accumulation. Accordingly, the approaches to monitoring the 
expression of galectin-3. detailed -above, (e.g., the protein and RNA detection methods) can be modif ied so that the 
expression of several galectins from numerous tissues or biological sample sources can be simultaneously analyzed. 
The use of traditional techniques, such as antibody-mediated serum assays, immunohistochemistry, EUSA. genechip 
northerns, and PCR will permit us to screen for multiple galectins rapidly. Additionally, the quantity of each galectin is 
preferably determined. The amount of RNA or protein: corresponding to the various galectins can be quantified by the 
approaches detailed above or by many other methods known to those of skill in the art. For example, the total protein 
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or RNA in a sample is first determined by conventional techniques (e.g., by spectroscopy); and then by using a probe 
having a label with a known specific activity, the amount of galectin expression can be accurately determined and 
normalized to a "house keeping marker." 

Once the levels of the various galectins are determined, the information is recorded onto a computer readable 
5 media, such as a hard drive, floppy disk, DVD drive, zip drive, etc.. After recording and the generation of a database 
comprising the levels of expression of the various galectins studied, a comparing program is used which compares the 
levels of expression of the various galectins so as to create a ratio of expression. In a first comparison, a galectin to 
galectin ratio is generated. For example, desirable galectin to galectin ratios include, but are not limited to: galectin- 
1:galectin-3, galectin- 1:galectin-5, galectin- 1:galectin-7, galectin- 1:galectin-8, galectin-1:galectin-9, galectin- 
10 3:galectin-5, galectin-3:galectin-7, galectin-3:galectin-8, galectin-3:galectin-9, galectin«5:galectin-7, galectin- 
5:galectin-8, galectin 5:gaiectin-9, galectin-7:galectin-8, galeft*m-7:galectin-9, and galectin-8:galectin-9. As other 
newly found galectins are identified, we contemplate screening for their expression patterns and incorporating the 
levels of expression into our ratio analysis approach to disease prognosis. In a second comparison, the galectin to 
galectin ratios from normal subjects are compared to galectin to galectin ratios of subjects having various diseases. 
15 Preferably, the diseased subjects studied initially have been identified as having a form of cancer. Desirably, several 
databases are generated comprising the galectin to galectin ratios from normal individuals and the galectin to galectin 
ratios from dise^ed subjects so that a statistical analysis can be accurately performed. In this manner patterns of 
galectin expression are analyzed and the predisposition to galectin related disease is determined. < 

Individuals who wish to be screened for a predisposition for a galectin-related disease can have a biological 
20 sample taken, ahd the RNA or protein present in the sample anlayzed for the galectin expression pattern, and the 
galectin to galectin ratios are compared to galectin to galectin ratios exhibited by both normal and diseased subjects. 
Depending on the comparison of the normal and diseased ratios, a helath care practitioner can choose an appropriate 
course of treatment and/or prevention. 

In the following discussion, we provide methods and compositions which are used to treat and/or prevent 
25 diseases associated with increased levels of galectin expression. 
Therapeutic and prophylactic agents 

In addition to the diagnosis of liver disease and galectin-3 related diseases, galectin-3 -specific antibodies are 
useful for the treatment of conditions and diseases associated with expression of galectin-3. Such antibodies include, 
but are not limited to, polyclonal, monoclonal, chimeric, single chain. Fab fragments ahd fragments produced by a Fab 
30 expression library. Neutralizing antibodies, i.e., those which inhibit dimer formation, are especially preferred for 
diagnostics and therapeutics. 

Galectin-3 protein to be used for antibody production need not retain biological activity; however, the 
protein fragment, or oligopeptide is desirably antigenic. Peptides used to induce specific antibodies may have an 
amino acid sequence consisting of at least five amino acids, preferably at least 10 amino acids. Preferably, they 
35 should mimic a portion of the amino acid sequence of the natural protein and may contain the entire amino acid 
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sequence of a small, naturally occurring molecule. Short stretches of galectin-3 amino acids may be fused with those 
of another protein such as keyhole limpet hemocyanin and antibody produced against the chimeric molecule. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, etc may be immunized by 
injection with galectin-3 or any portion, fragment or oligopeptide which retains immunogenic properties. Depending on 
the host species, various adjuvants may be used to increase immunological response. Such adjuvants include but are 
not limited to Freund's, mineral gels such as aluminum hydroxide, and surface active substances such as lysolecithin, 
pluronic polyols, poiyanions, peptides, oil emulsions, keyhole limpet hemocyanin, and dinitrophenol. BCG (Bacillus 
Calmette Guerin) and Corynebacterium parvum are potentially useful adjuvants. 

Monoclonal antibodies to galectin-3 may be prepared using, any technique which provides for the production 
of antibody molecules by continuous cell lines in culture. These include but are not limited to the hybridoma technique 
originally described by Koehier and Milstein (Nature -256:495497 (1975), the human B-cell hybridoma technique 
(Kosbor et al. Immunol Today 4:72 (1983); Cote et al Proc Natl Acad Sci 80:2026-2030 (1983), and the EBV- 
hybridoma technique Cole et al. Monoclonal Antibodies and Cancer Therapy, Alan R. Uss Inc. New York N.Y., pp 77-96 
(1985). 

In addition, techniques developed for the production of "chimeric antibodies", the splicing of mouse antibody 
genes to human antibody genes to obtain a molecule with appropriate antigen specificity and biological activity can be 
used. (Morrison et al. Proc Natl Acad Sci 81:6851-6855 (1984); Neuberger. et al. Nature ,31 2:604-608(1 984); 
Takeda et al. Nature 314:452454(1985). Alternatively, techniques described for the production of single chain 
antibodies (U.S. Pat. No. 4.946.778) can be adapted to produce galectin-3 -specific single chain antibodies. 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population or by screening 
recombinant immunoglobulin libraries or panels of highly specific binding reagents as disclosed in Qriandi et al.. Proc 
Natl Acad Sci 86: 3833-3837 (1989), and Winter G, and Milstein C; Nature 349:293-299 (1991). , _„ . .. 

Antibody fragments which contain specific binding sites for galectin-3 may. also be generated. For example, 
such fragments include, but are not limited to. the Ftab') 2 fragments whichxan be produced by pepsin digestion of the 
25 antibody molecule and the Fab fragments which can be generated by reducing the disulfide bridges of the Flab'), 
fragments. Alternatively. Fab expression libraries may be constructed to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity Huse W. D. et al. Science 256:1275 1 281 (1989). 

By "antibody therapy" is meant the administration of an antibody, an antibody conjugate, or an antibody 
heteroconjugate to a subject in need for the purpose of treating or preventing a disease. Many forms of antibody 
30 therapy are known in the art including, but not limited to. the administration of monoclonal or humanized monoclonal 
' antibodies, the administration of toxin or radionuclide conjugated antibodies, and the administration of. monoclonal 
antibody heteroconjugates having one domain that binds to a disease-related antigen (e.g., a galectjn such, as gaiectin- 
3) and another domain that binds to the Fc region of IgG. Further, the. terra antibody therapy is meant to include the 
administration of (Fab'), or Fab fragments with or without conjugated toxins or radionuclides. Several examples of 
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antibody therapy :are< found in Recent Results in Cancer Research vol 141:Svstemic Radiotherapy with Monoclonal 
Antibodies, edited by Ml. Sautter-Bihl and M. Wannenmacher, Springer-Verlag publishers, 1996. 

Radiolabelted mAbs specific for a tumor antigen (e.g., .a gaiectin such as galectin-3) are prepared by labeling 
the antibody with amisotope or combinations of isotopes, such as,\ 31 l, 90 Y, 67 Cu, ,88 Re, ,M Re, 212 Bi or 2U At. Preferable 

5 radiolabeled mAbs are able to deliver more than 6000 rads to the tumor, and have sufficient affinity so that the 
patient's bone marrow is not exposed to more than 300 rads. Wexontempiate the use of !3, l labeled anti-galectin-3, 
as well as other radiolabeled mAbs for treatment of HCC and cirrhosis of the liver. In this aspect and in several other 
embodiments using radionuclide or toxin conjugated antibodies, a delivery system which preferentially allows delivery 
of the antibody therapy agent to diseased cells is desired. By one approach, a liposome having the antibody therapy 

10 agent and a membrane bound protein, which interacts specifically with liver cells is used. Additionally, the localized 
delivery of the antibody therapy agent (e.g., injecting the agent directly into liver tissue) is contemplated. 

Further, the development of peptidomimetics and other compounds which interfere with galectin-3 function 
so as to generate pharmaceuticals which can be used to treat galectin-3 related liver diseases are embodiments of the 
invention. Galectin-3, its catalytic or immunogenic fragments or oligopeptides thereof, can be used for screening 

15 therapeutic compounds in any of a variety of drug screening techniques. The fragment employed in such a test may 
be free in solution, affixed to a solid support, borne on a cell surface, or located intracellular^. The formation of 
binding complexes, between human galectin-3 and the agent being tested, may be measured. Another technique for 
drug screening which may be used for high throughput screening of compounds having suitable binding affinity to 
galectin-3 is described in detail in "Determination of Amino Acid Sequence Antigenicity** by Geysen H. N., WO 

20 Application 84/03564, published on Sep. 13, 1984, and incorporated herein by reference. In summary, large numbers 
of different small peptide test compounds are synthesized on a solid substrate, such as plastic pins or some other 
surface. The peptide test compounds are reacted with fragments galectin-3 and washed. Bound galectin-3 is then 
detected by methods well known in the art Substantially purified galectin-3 can also be coated directly onto plates 
for use in the aforementioned drug screening techniques. Alternatively, non-neutralizing antibodies can be used to 

25 capture the peptide and immobilize it on a solid support or many of the approaches for attaching a biomolecule to a 
support, described above, can be used. Once test compounds are found, they are preferably tested in a test subject 
having liver disease such as HCC and cirrhosis of the liver and the effect on galectin-3 activity is analyzed by one or 
more of the approaches detailed above. Successful agents are then incorporated into pharmaceuticals and are used to 
treat and/or prevent liver diseases such as HCC and cirrhosis of the liver. 

30 Additionally, we contemplate the use of antisense and ribozyme technology to inhibit the expression of galectin-3 

and/or other gaiectins. The antisense nucleic acid molecules to be used in gene therapy may be either DNA or RNA 
sequences. They may comprise a sequence complementary to the sequence of mRNA encoding galectin-3. The antisense 
nucleic acids should have a length and melting temperature sufficient to permit formation of an intracellular duplex having 
sufficient stability to inhibit the expression of the mRNA in the duplex; Strategies for designing antisense nucleic acids 
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suitable for use in gene therapy are disclosed in Green et al., Ann. Rev. Biochem.. 55:569-597 (1986) and Izant and 
Weintraub, Celt, 36:1007-1015 11984). 

In some strategies, antisense molecules are obtained from a nucleotide sequence encoding gaiectin-3 by reversing 
the orientation of the coding region with respect to a promoter so as to transcribe the opposite strand from that which is 
normally transcribed in the cell. Antisense molecules and ribozymes may be prepared by any method known in the art 
•for the synthesis of RNA molecules. These include techniques for chemically synthesizing oligonucleotides such as 
solid phase phosphoramidite chemical synthesis. Alternatively. RNA molecules may be generated by in vitro and in 
mo transcription of DNA sequences encoding galectin-3 or another galectin. Such DNA. sequences may be 
incorporated into a wide variety of vectors with suitable RNA polymerase promoters such as T7 or SP6. 
) Alternatively, antisense cONA constructs that synthesize antisense RNA constitutiveiy or inducibly can be introduced 

into cell lines, cells or tissues. 

Alternatively, oligonucleotides which are complementary to the mRNA encoding galectin-3 may be synthesized in 
vitro. Thus, antisense nucleic acids are capable of hybridizing to the galectin-3 mRNA to create a duplex. In some 
embodiments, the antisense sequences may contain modified sugar phosphate backbones to increase stability and make 
5 them less sensitive to RNase activity. Possible modifications include, but are not limited to. the addition of flanking 
sequences at the 5' and/or 3" ends of the molecule or the use of phosphorothioate or T Omethyl rather than 
phosphodiesterase linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine and 
wybutosine as well as acetyl-, methyl-, thio- and similarly modified forms of adenine, cytidine, guanine, thymine, and 1 
20 uridine which are not as easily recognized by endogenous endonucleases. Further examples are described by Rossi et 
aL, Pharmacol. Then, 50(2):245-254. (1991). 

Various types of antisense oligonucleotides complementary to the galectin-3 mRNA may be used. In one 
preferred embodiment, stable and semi-stable antisense oligonucleotides described in International Application No. PCT 
W094/23026, hereby incorporated by reference, are used. In these moleucles, the 3 end or both the 3 and 5 ends are 
25 engaged in intramolecular hydrogen bonding between complementary base pairs. These molecules are better able to 
withstand exonuclease attacks and exhibit increased stability compared to conventional antisense oligonucleotides. In 
another preferred embodiment, the antisense oligodeoxynucleotides described in International Application No. WO 
95/04141 are used. 

In yet another preferred embodiment, the covalently cross-Hnked antisense oligonucleotides described in 
30 International Application No. WO 96/31 523 are used. These double- or single-stranded oligonucleotides comprise one or 
more, respectively, inter- or intra-oligonucleotide covalent cross-linkages, wherein the linkage consists^ an amide bond 
between a primary amine group of one strand and acarboxyl group of the other strand or of the same strand; respectively, 
the primary amine group being directly substituted in the 2' position of the strand nucleotide monosaccharide ring, and the 
carboxyl group being carried by an aliphatic spacer group substituted on a nucleotide" or nudeotide enalog of the other 
35 strand olr the same strand, respectively. 
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The antisense oligodeoxynucleotides and oligonucleotides disclosed in International Application No. WO 
92/18522 may also be used. These molecules are stable to degradation and contain at least one transcription control 
recognition sequence which binds to control proteins and are effective as decoys therefor. These molecules may contain 
"hairpin" structures, "dumbbell" structures, "modified dumbbell" structures, "cross-linked" decoy structures and "loop" 
5 structures. In another preferred embodiment, the cyclic double-stranded oligonucleotides described in European Patent 
Application No. 0 572 287 A2 are used. These ligated oligonucleotide "dumbbells" contain the binding site for a 
transcription factor and inhibit expression of the gene under control of the transcription factor by sequestering the factor. 
Use of the closed antisense oligonucleotides disclosed in International Application No. WO 92/19732, hereby incorporated 
by reference, is also contemplated. Because these molecules have no free ends, they are more resistant to degradation by 
10 exonucleases than are conventional oligonucleotides. These oligonucleotides may be multifunctional, interacting with 
several regions which are not adjacent to the target rnRNA. 

The appropriate level of antisense nucleic acids required to inhibit gene expression may be determined using in 
vitro expression analysis. The antisense molecule may be introduced into the cells expressing galectin-3 by diffusion, 
injection, infection or transfection using procedures known in the art. For example, the antisense nucleic acids can be 
15 introduced into the body as a bare or naked oligonucleotide, oligonucleotide encapsulated in lipid, oligonucleotide sequence 
encapsidated by viral protein, or as an oligonucleotide operably linked to a promoter contained in an expression vector. The 
expression vector may be any of a variety of expression vectors known in the art, including retroviral or viral vectors, 
vectors capable of extrachromosomal replication, or integrating vectors. The vectors may be DNA or RNA. 

The antisense molecules are introduced onto cell samples at a number of different concentrations preferably 
20 between 1x10* ,0 M to 1x10 4 M. Once the minimum concentration that can adequately control gene expression is identified, 
the optimized dose is translated into a dosage suitable for use in vivo. For example, an inhibiting concentration in culture of 
1x10 7 translates into a dose of approximately 0.6 mg/kg bodyweight. Levels of oligonucleotide approaching 100 mg/kg 
body weight or higher may be possible after testing the toxicity of the oligonucleotide in laboratory animals. It is 
additionally contemplated that cells from a vertebrate, such as a mammal or human, are removed, treated with the 
25 antisense oligonucleotide, and reintroduced into the vertebrate. 

Ribozymes may also be used to reduce or eliminate galectin-3 expression. Ribozymes are enzymatic RNA 
molecules capable of catalyzing the specific cleavage of RNA. The mechanism of ribozyme action involves sequence- 
specific hybridization of the ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage. 
Within the scope of aspects of the invention, are engineered hammerhead motif ribozyme molecules that can 
30 specifically and efficiently catalyze endonucleolytic cleavage of a sequence encoding human galectin-3, for example. 
Specific ribozyme cleavage sites within any potential RNA target are initially identified by scanning the target 
molecule for ribozyme cleavage sites which include the following sequences, GUA, BUU and GUC. Once identified, 
short RNA sequences of between 15 and 20 ribonucleotides corresponding to the region of the target gene containing 
the cleavage site may be evaluated for secondary structural features which may render the oligonucleotide inoperable. 
35 The suitability of candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
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oligonucleotides using ribonuclease protection assays. Delivery of antisense and ribozyme agents by transfection and 

by liposome are quite well known in the art. 

The pharmacologically active compounds of this invention can be processed in accordance with conventional 
methods of galenic pharmacy to produce medicinal agents for administration to patients, e.g., mammals including 
5 humans. The antibodies, fragments thereof, peptidomimetics, antisense. and ribozymes may be incorporated into a 
pharmaceutical product with and/or without modification. Further, we envision the manufacture of pharmaceuticals 
or therapeutic agents which deliver anti-galectin-3 agents or a nucleic acid sequence encoding the antigalectin-3 
agent by several routes. For example, and not by way of limitation, the use of DNA. RNA. and viral vectors having 
sequence encoding an antibody to galectin-3 is contemplated. Nucleic acids encoding an antibody to galectin-3 can be 
10 administered alone or in combination with peptides or an antibody to galectin-3. 

The compounds of this invention can be employed in admixture with conventional excipients. ..e.. 
pharmaceutically acceptable organic or inorganic carrier substances suitable for parenteral, enteral <e.g., oral) or 
topical application which do not deleterious* react with the active compounds. Suitable pharmaceutically acceptable 
carriers include but are not limited to water, salt solutions, alcohols, gum arable, vegetable oils, benzyl alcohols. 
15 polyetylene glycols, gelatine, carbohydrates such as lactose, amylase or starch, magnesium stearate. talc, silicic acid, 
viscous paraffin, perfume oil. fatty acid monoglycerides and diglycerides. pentaerythrito. fatty acid esters, hydroxy 
methylcellulose. polyvinyl P yrrolidone,etc. The pharmaceutical preparations can be sterilized and if desired mixed 
with auxiliary agents, lubricants, preservatives, stabilizers, wetting agents, emulsifiers. salts for influencmg 
osmotic pressure, buffers, coloring, flavoring and/or aromatic substances and the like which do not deleteriously react 
20 " with the active compounds. They can also be combined where desired with other active agents, e.g.. vitamins. 

For parenteral appUcation. particularly suitable ere injectable, sterile solutions, preferably oHy or aqueous 
solutions, as well es suspensions, emulsions, or implants, including suppositories. Ampoules are convenient unit 
dosages. 

For enteral application, particularly suitable are tablets, dragees. liquids, drops, suppositories, or capsules. A 
25 syrup, elixir, or the like can be used wherein a sweetened vehicle is employed. 

Sustained or directed release compositions can be formulated, e.g., liposomes or those wherein the active 
compound is protected with differentially degradable coatings. e.g.. by microencapsulation, multiple coatings, etc. It 
is also possible to f reeze-dry the new compounds and use the lyophilizates obtained, for example, for the preparation 
of products ior injection. 

30 ' For topical application, there are employed as non-sprayable forms, viscous to semi-solid or solid forms 

comprising a carrier compatible with topical application and having a dynamic viscosity preferably greater than water. 

Suitable formulations include but are not limited to solutions, suspensions, emulsions, creams, ointments, powders. 

liniments, salves, aerosols, etc.. which are, if desired, sterilized or mixed with auxiliary agents, e.g.. preservatives. 

stabilizers, wetting agents, buffers or salts for influencing osmotic pressure, etc. For topical application, also surtable 
35 are sprayable aerosol preparations wherein the active ingredient preferably* combination with a solid or hqu.d mert 
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carrier material, is packaged in a squeeze bottle or in admixture with a pressurized volatile, normally gaseous 
propellant, e.g., a freon. 

It will be appreciated that the actual preferred amounts of active compound in a specific, case will vary 
according to the specific compound being utilized, the particular compositions formulated, the mode of application, and 

5 the particular situs and organism being treated. Pharmaceutical compositions suitable for use in the present invention 
include compositions wherein the active ingredients are contained in an effective amount to achieve the intended 
purpose. Dosages for a given host can be determined using conventional considerations, e.g.. by customary 
comparison of the differential activities of the subject compounds and of a known agent. e.g., by means of an 
appropriate, conventional pharmacological protocol. The determination of an effective dose is well within the 

10 capability of those skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially either in cell culture assays 
or in animal models, usuetly mice, rabbits, dogs, or pigs. The animal model is also used to achieve a desirable 
concentration range and route of administration. Such information can then be used to determine useful doses and 

routes for administration in humans. 
15 A therapeutically effective dose refers to that amount of protein or hs antibodies, antagonists, or inhibitors 

which ameliorate the symptoms or condition. Therapeutic efficacy and toxicity of such compounds can be determined 
by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., ED50 (the dose therapeutically 
effective in 50% of the population) and LD50 (the dose lethal to 50% of the population). The dose ratio of toxic to 
therapeutic effects is the therapeutic index, and it can be expressed as the ratio, LD50/ED50. Pharmaceutical 
20 compositions which exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and 
animal studies is used in formulating a range of dosage for human use. The dosage of such compounds lies preferably 
within a range of circulating concentrations that include the ED50 with little or no toxicity. The dosage varies within 
this range depending upon the dosage form employed, sensitivity of the patient, and the route of administration. 

The exact dosage is chosen by the individual physician in view of the patient to be treated. Dosage and 
25 administration are adjusted to provide sufficient levels of the active moiety or to maintain the desired effect. 
Additional factors which may be taken into account include the severity of the disease state, e.g.. tumor size and 
location; age. weight and gender of the patient; diet, time and frequency of administration, drug combinations), 
reaction sensitivities, and tolerance/response to therapy. Long acting pharmaceutical compositions might be 
administered every 3 to 4 days, every week, or once every two weeks depending on half-life and clearance rate of the 

30 particular formulation. 

Normal dosage amounts may vary from 0.1 to 100,000 micrograms, up to a total dose of about 10 g, 
depending upon the route of administration. Guidance as to particular dosages and methods of delivery is provided in 
the literature. See U.S. Pat. Nos. 4.657.760; 5,206,344; or 5,225,212. Those skilled in the art will employ different 
formulations for nucleotides than for proteins or their inhibitors. Similarly, delivery of polynucleotides or polypeptides 

35 will be specific to particular cells, conditions, locations, etc.. 
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Although the invention has been described with reference to embodiments and examples, it should be 
understood that various modifications can be made without departing from the spirit of the invention. Accordingly, 
the invention is limited only by the following claims. All references cited herein are hereby expressly incorporated by 
reference. 



•26- 



BNSDOCID: <WO O082076A1J_> 



WO 00/62076 PCT/US00/08561 



WHAT IS CLAIMED IS: 

1 . A method of identifying the presence of galectin-3 in cirrhotic or cancerous liver comprising: 
obtaining a sample of cirrhotic or cancerous liver tissue; 

providing an antibody to galectin-3; 
5 contacting the sample with the antibody; and 

analyzing the presence of galectin-3 in the sample. 

2. The method of Claim 1 . wherein the cancerous liver tissue is hepatocellular carcinoma. 

3. The method of Claim V, wherein the analyst of the presence of galectin-3 in the sample comprises 
a technique of the group consisting of fluorescence activated cell sorting (FACS), immunoprecipitation, Western blot 

10 antibody staining, and a galectin-3 binding assay. 

4. A method of identifying the amount galectin-3 expression in cirrhotic or cancerous liver comprising: 
obtaining a sample of cirrhotic or cancerous liver tissue; 

providing a probe which interacts with gaiectin-3 or an RN A encoding galectin-3; 
contacting the sample with the probe; and 
15 analyzing the presence of galectin-3 in the sample. 

5. The method of Claim 4, wherein the cancerous liver tissue is hepatocellular carcinoma. 

6. The method of Claim 4, wherein the probe is a member of the group consisting of a nucleic acid, a 
protein, a carbohydrate chemical agent, and a peptidomhnetic 

7. The method of Claim 4, wherein the analysis of the presence of galectin-3 in the sample comprises 
20 a technique of the group consisting of fluorescence activated cell sorting (FACS), immunoprecipitation, Western blot, 

antibody staining, a galectin-3 binding assay, and a hybridization assay. 

8. The method of Claim 4 further comprising quantifying the amount of galectin-3 protein or RNA in 
the sample. 

9. The method of Claim 4 or 8 further comprising recording the amount of galectin-3 expression or the 
25 amount of galectin-3 on a computer readable media. 

10. A method of identifying a subject in need of treatment or prevention of liver disease comprising: 
obtaining a biological sample from a subject having RNA or protein, wherein the biological sample is 

obtained from the subject's liven 

providing a probe which interacts with galectin-3 protein or RNA encoding galectin-3; 
30 contacting the biological sample with the probe under conditions which allow the probe to interact 

with the RNA or protein in the biological sample; 

detecting the amount or probe which interacts with the RNA or protein in the biological sample so 
as to determine the amount of gaiectin-3 expression; 

identifying the subject as a subject in need of treatment or prevention of liver disease by the 
35 presence of the probe in hepatocytes. 
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11. The method of Claim 10, wherein the liver disease is cirrhosis or hepatocellular carcinoma, or any 

neoplasm originating from the liver. 

12. The method of Claim 10, wherein the biological sample comprises an hepatocyte. 

1 3. The method of Claim 1 0, wherein the probe is a member of the group consisting of a nucleic acid, a 
5 protein, a carbohydrate chemical agent, and a peptidomimetic. 

14. The method of Claim 10, wherein the detection of the amount of probe which interacts with the 
RNA or protein comprises a technique of the group consisting of fluorescence activated cell sorting (FACS), 
immunoprecipitation, Western blot, antibody staining, a galectin-3 binding assay, and a hybridization assay. 

15. The method of Claim 10 further comprising quantifying the amount of galectin-3 protein or RNA in 

10 the sample. 

1 6. The method of Claim 1 0 or 15 further comprising recording the amount of galectin-3 expression on 

a computer readable media. 

1 7. A method of identifying a predisposition to disease comprising: 

obtaining a biological sample from a subject, wherein the biological sample has protein or RNA; 
15 providing a plurality of probes which interact with a plurality of galectin proteins or a plurality of 

RNA sequences encoding a plurality of galectin proteins; 

contacting the protein or RNA from the biological sample with the plurality of probes under 
conditions which allow the probes to interact with galectins which may be present in the biological sample; 
delecting the probes which interact with the RNA or protein in the sample so as to determine a 
20 level of expression of more than one galectin; and 

analyzing the level of expression of more than one galectin to identify the predisposition for 

disease. 

18. The method of Claim 17, wherein the plurality of probes interact with at least two proteins of the 
group consisting of galectin-1, galectm-2. galectin-3. galectin-4, galectin-5, galectin-5H v galectin-5, galectin-7, 

25 galectin-8, galectin-9, and galectin-10 or two RNAs having the entire coding sequence or a sequence within the entire 
coding sequence of an RNA encoding a member of the group consisting of galectin-1, galectin-3, *alectin-4, galectin-5, 
galecthv5H. galectin-6, galectin-7, galectin-8, galectin-9, and>galectin-10. 

19. The method of Claim 17, wherein the disease is cirrhosis or hepatocellular carcinoma, or any 
neoplasm originating from the liver. 

30 20. The method of Claim 17, wherein the biological sample comprises an hepatocyte. 

21. The method of Claim 17, wherein the plurality of probes have , at least two members of the group 
consisting of a nucleic acid, a protein, a carbohydrate chemical agent, and a peptidomimetic. 

22. The method of Claim 17, wherein the detection of the amount of probe which interacts with the 
RNA or protein comprises" a technique of the group consisting of fluorescence activated cell sorting (FACS). 

35 inununoprecipitation. Western blot, antibody staining, a galectin binding assay, and a hybridization assay. , 
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23. The method of Claim 17 further comprising quantifying the amount of galectin protein or UNA in 
the sample. 

24. The method of Claim 17 further comprising calculating a ratio of galectin expression, wherein the 
ratio comprises the level of expression of a first galectin compared to the level of expression of a second galectin or 

5 the ratio comprises the level of expression of a galectin from a biological sample of a first subject compared to the 
level of expression of a galectin from a biological sample from a second subject. 

25. The method of Claim 17, 23, or 24 further comprising recording the amount of galectin expression 
on a computer readable media. 

26. A method of making a pharmaceutical for the treatment or prevention of hepatocellular carcinoma 
10 comprising incorporating an agent which inhibits the prevalence of galectin-3 in liver cells or tissue in a pharmaceutical 

formulation, wherein the agent is a member of the group consisting of an antibody to galectin-3, a peptide, a 
carbohydrate, a chemical agent, a peptidomimetic which interacts with galectin-3, an antisense oligonucleotide 
complementary to a transcript encoding galectin-3, and a ribozyme complementary to a transcript encoding galectin-3. , 

27. A method of treatment or prevention of hepatocellular carcinoma or cirrhosis of the liver 
15 comprising administering an agent which inhibits the prevalence of galectin-3 in liver cells or tissue in a 

pharmaceutical formulation, wherein the agent is a member of the group consisting of an antibody to galectin-3, a 
carbohydrate, a chemical agent, a peptide a peptidomimetic which interacts with galectin-3, an antisense 
oligonucleotide complementary to a transcript encoding galectin-3, and a ribozyme complementary to a transcript 
encoding galectin-3. 

20 28. The method of Claims 26 or 27, wherein the agent is an antibody conjugated to a toxin or a 

radionuclide. 

29. A method of screening for prognosis of cirrhosis or neoplasm originating from liver in a human 
comprising: 

(a) isolating test proteins or RNA from a biological sample having hepatocytes, said 
25 biological sample being obtained from said subject; and 

(b) determining the level of galectin-3 protein or an RNA encoding galectin-3, wherein an 
elevated level of said galectin-3 protein or RNA encoding galectin-3 indicates a positive correlation with 
development of cancer or cirrhosis of the liver, and wherein the step of determining the level of galectin-3 
protein or an RNA encoding galectin-3 is carried out by probing said galectin-3 protein or RNA encoding 

30 galectin-3 with an antibody which recognizes galectin-3 protein or a nucleic acid complementary to the RNA 

encoding galectin-3. 

30. A method of screening for risk of cancer or cirrhosis of the liver in a human subject comprising: 

(a) isolating test proteins or RNA from a biological sample having hepatocytes, said 
biological sample being obtained from said subject; J 
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(fa) determining the level of galectin-3 protein or an RNA encoding galectin-3, wherein an 
elevated level of said galectin-3 protein or RNA encoding galectin-3 indicates a positive correlation with 
development of cancer or cirrhosis of the liver, and wherein the step of determining the level of galectin-3 
protein or an RNA encoding galectin-3 is carried out by probing said gaiectin-3 protein or RNA encoding 
5 galectin-3 with an antibody which recognizes galectin-3 protein or a nucleic acid complementary to the RNA 

encoding galectin-3; and 

(c) identifying the human subject as being at risk based on the level of gaiectin-3 protein or RNA 
encoding galectin-3 determined in step (b). 

31. 1 The method of Claim 29 or 30 wherein said cancer is hepatocellular carcinoma or any neoplasm of 
10 liver origin, 

32. A method for determining whether an individual is suffering from hepatocellular carcinoma or 
cirrhosis of the liver or is likely to suffer to suffer from hepatocellular carcinoma or any neoplasm of liver origin or 
cirrhosis of the liver in the future comprising determining whether the level of galectin-3 in the liver of said individual is 
above normal. 

15 33, The method of Claim 32. wherein the step of determining whether the level of gaiectin-3 

expression in the liver of said individual is above normal comprises determining whether the level of galectin-3 protein 
in a biological sample obtained from the liver of said individual is above normal. 

34. The method of Claim 33, wherein the step of determining whether the level of galectin-3 protein in 
a biological sample obtained from the liver of said individual is above normal comprises: 

20 contacting said biological sample with an antibody capable of specifically binding to galectin-3; and 

determining whether the level of antibody binding to said sample is greater than the level of 
antibody binding observed in samples from individuals who are not suffering from hepatocellular carcinoma 
or cirrhosis of the liver. 

35. The method of Claim 34, wherein the biological sample is a liver biopsy. 

25 36. The method of Claim 35, wherein the determining step comprises determining whether the level of 

antibody binding to hepatocytes in the liver biopsy is greater than the level observed in biopsies from individuals not 
suffering from hepatocellular carcinoma or cirrhosis of the liver. 

37. The method of Claim 32, wherein the step of determining the whether the level of gaiectin-3 
expression in the liver of said individual is above normal comprises determining the level of mRNA encoding galectm-3 

30 present in a biological sample obtained from the liver of said individual. 

38. The method of Claim 37, wherein the level of galectin-3 mRNA is determinedly a method from the 
group consisting of a nucleic acid amplification reaction, a nucleic acid hybridization assay, and an RNAse protection 
assay. 

39. A method for correlating the ratio between the expression level of two or more galectins with a 
35 disease state comprising: 
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obtaining biological samples from individuals suffering from said disease state and normal 
individuals; 

measuring the expression levels of said two or more galectins in said biological samples; 
calculating a ratio between said expression levels in said samples; 
5 and determining whether there is a statistically significant association of said ratio with said 

disease state. 

40. The method of Claim 39, wherein said galectins are selected from the group consisting of galectin. 
t, galectin-2, galectin-3, gaIectin-4, galectin-5, galectin 5H, galectin-7, galectin-8, galectin-9, and galectin-10. 

41. A method of determining whether an individual is suffering from or is likely to suffer from a disease 

10 in the future comprising: 

obtaining a biological sample from said individual; 

determining the expression levels of at least two galectins in said biological sample; 

calculating the ratio between the expression levels of said galectins; and 

determining whether said ratio is a ratio which is statistically correlated with a disease or a 

15 predisposition to a disease. 

42. The method of Claim 41 , wherein said galectins are selected from the group consisting of galectin- 
1, galectin-2, galectin-3, gaiectin-4, galectin-5, galectin 5H, galectin-7, galectin-8, galectin-9, and galectin-10. 

43. A kit for determining galectin expression in hepatocellular cells comprising: 
hepatocellular cells; and 

20 a probe indicative of galectin expression in hepatocellular cells. 

44. A reaction mixture comprising: 
hepatocellular cells of a patient; and 

a probe indicative of galectin expression in hepatocellular cells. 

45. The method of Claims 10, 17, 29, 30, and 41, wherein the biological sample is a body fluid. 

25 46. A method of determining prognosis of cirrhosis or neoplasm originating from the liver comprising: 

(a) obtaining a biological sample; 

(b) measuring the level of expression of two or more galectins; 

(c) Calculating a ratio based on the level of expression of the galectins in step (b); and 

(d) determining the prognosis of cirrhosis or neoplasm by identifying a ratio calculated in 
30 step (c) which is above normal levels of galectin expression. 

47. The method of Claim 46, wherein the galectins measured in step (b) are at least two members of 
the group consisting of galectin-1, galectin-2, galectin-3, galectin-4, galectin-5, galectin 5H, galectin-7, galectin-8, 
galectin-9, and galectin-1 0. 
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FIGURE 1 
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Human galectin-3 



1 


CCAGCCAACG 


AGCGGAAAAT 


GGCAGACAAT TTTTCGCTCC 


ATGATGCGTT ,. 


51 


ATCTGGGTCT 


GGAAACCCAA 


ACCCTCAAGG ATGGCCTGGC 


GCATGGGGGA 


101 


ACCAGCCTGC 


TGGGGCAGGG 


GGCTACCCAG GGGCTTCCTA 


TCCTGGGGCC 


151 


TACCCCGGGC 


AGGCACCCCC 


AGGGGCTTAT CCTGGACAGG 


CACCTCCAGG 


201 


CGCCTACCAT 


GGAGCACCTG 


GAGCTTATCC CGGAGCACCT 


GCACCTGGAG 


251 


TCTACCCAGG 


GCCACCCAGC 


GGCCCTGGGG CCTACCCATC 


TTCTGGACAG 


301 


CCAAGTGCCC 


CCGGAGCCTA 


CCCTGCCACT GGCCCCTATG 


GCGCCCCTGC 


351 


TGGGCCACTG 


ATTGTGCCTT 


ATAACCTGCC TTTGCCTGGG 


GGAGTGGTGC 


401 


CTCGCATGCT 


GATAACAATT 


CTGGGCACGG TGAAGCCCAA 


TGCAAACAGA 


451 


ATTGCTTTAG 


ATTTCCAAAG 


AGGGAATGAT. GTTGCCTTCC 


ACTTTAACCC 


501 


ACGCTTCAAT 


GAGAACAACA 


GGAGAGTCAT TGTTTGCAAT 


ACAAAGCTGG 


551 


ATAATAACTG 


GGGAAGGGAA 


GAAAGACAGT CGGTTTTCCC 


ATTTGAAAGT 


601 


GGGAAACCAT 


TCAAAATACA 


AGTACTGGTT GAACCTGACC 


ACTTCAAGGT 


651 


TGCAGTGAAT 


GATGCTCACT 


TGTTGCAGTA CAATCATCGG 


GTTAAAAAAC 


701 


TCAATGAAAT 


CAGCAAACTG 


GGAATTTCTG GTGACATAGA 


CCTCACCAGT- 


751 


GCTTCATATA C CATGATAT A ATC TGAAAGG GGCAGATTAA 


AAAAAAAAAA 


801 


AAAGAATCTA 


AACCTTACAT 


GTGTAAAGGT TTCATGTTCA 


CTGTGAGTGA 


851 


AAATTTTTAC 


ATTCATCAAT 


ATCCCTCTTG TAAGTCATCT 


ACTTAATAAA 


901 


TATTACAGTG 


AAAG (SEQ 


. ID. NO. 47) 
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SEQUENCE LISTING 



<110> HSU, Daniel, K. 
LIU, Fu-Tong 

DOWLING, Christopher, A. 



r 

. nn> nALECTIN EXPRESSION IS INDUCED IN 

<120> GA ^;^TIC LIVER AND HEPATOCELLULAR CARCINOMA 

<130> DANHSU.001VPC 

<140> 60/129,111 
<141> 1999-04-13 

<160> 47 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 50 
<212> PRT 
<213> chicken 



SU°Sli Ala Met Lys Ala Arg Cys Trp Gin Pro His Trp Met Leu Pro 

His Pro Gin Leu Ser As* 
25 30 

3 Asn Pro Gly Ala Pro 

35 



Leu Leu Pro Leu Ser Ser Pro Leu His So Gin Leu Ser Asp Ala Leu 

90 ^5 
Pro Ala His Asn Pro Gly Ala Pro Pro Pro Gin Gly Trp Asn Arg Pro 

40 4D 



Pro Gly 
50 



<210> 2 ( 
<211> 50 
<212> PRT 
<213> chicken 



5ro°Gly Ala Phe Pro Ala Tyr Pro Gly Tyr Pro Gly Ala Tyr Pro Gly 



aL Pro Gly Pro Tyr Pro Gly Ala Pro Gly Pro His His Gly Pro Pro 
Gly Pro Tyr So Gly Gly Pro Pro Gly Pro Tyr Pro Gly Gly Pro Pro 



35 

Gly Pro 
50 



<210> 3 
<211> 27 
<212> PRT 
<213> nematode 



Me^Ser Ala Glu Glu Pro Lys Ser Tyr Pro Val Pro Tyr Arg Ser Val 

i 5 10 13 

Leu Gin Glu Lys Phe Glu Pro Gly Gin Thr Leu 
20 25 
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<210> 4 
<211> 17 
<212> PRT 
<213> eel 

<400> 4 

Ser Gly Gly Leu Gin Val Lys Asn 

1 " 5 
Leu 



Phe Asp Phe Thr Val Gly Lys Phe 
10 15 



<210> 5 
<211> 43 
<212> PRT 
<213> chicken 

<400> 5 

Tyr Pro Gly Gly Pro Pro Gly Pro 

1 5 
Tyr Ser Glu Ala Pro Ala Ala Pro 

- ; 20 

Leu Pro Ala Gly Leu Met Pro Arg 

35 ' 40 



Tyr Pro Gly Gly Pro Thr Ala Pro 

10 ' " 15 
Leu Lys Val Pro Tyr Asp Leu Pro 
25 ~ 30 . 

Leu Leu lie 



<210> 6 
<211> 33 
<212> PRT 
<213> rat 

<400> 6 

Met Ala Tyr Val Pro Ala Pro Gly 

1 5 
Leu Pro Tyr Lys Arg Pro lie Pro 
20 

He 



Tyr Gin Pro Thr Tyr Asn Pro Thr . 

10 15 
Gly Gly Leu Ser Val Gly Met Ser 
25 30 



<210> 1 
<211> 12 
<212> PRT 
<213> mouse 

<400> 7 

Pro He Pro Gly Gly Leu Ser Val Gly Met Ser Val 
1 5 10 



<210> 8 
<2\\> 18 
<212> PRT 
<213> human 

<400> 8 

Met Ala Cys Gly Leu Val Ala Ser 

1 * 5 

Cys Leu 



Asn Leu Asn Leu Lys Pro Gly Glu 
10 15 
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<210> 9 
<211> 33 
<212> PRT 
<213> human 



Met°Ala Tyr Val Pro Ala Pro Gly Tyr Gin Pro Thr Tyr Asn Pro Thr 

1 5 .10 XD 

Leu Pro Tyr Tyr Gin Pro He Pro Gly Gly Leu Asn Val Gly Met Ser 

20 25 JU 

Val 



<210> 10 
<211> 42 
<212> PRT 
<213> nematode 



^e°?ai°Lys Gly Ser Thr He Asp Glu.Ser Gin Arg Phe Thr lie Asn 

. 5 10 " 

Leu His Ser Lys Thr Ala Asp Phe Ser Gly Asn Asp Val Pro Leu His 

20 25 30 

Val Ser Val Arg Phe Asp Glu Gly Lys He 



35 4 ° 



<210> 11 
<211> 41 
<212> PRT 
<213> eel 

<400> 11 



Thr Val Gly Gly Phe He Asn Asn Ser Pro Gin Arg Phe Ser Val Asn 

1 5 10 

Val Gly Glu Ser Met Asn Ser Leu Ser Leu His Leu Asp His Arg Phe 

20 25 30 

Asn Tyr Gly Ala Asp Gin Asn Thr He 
35 40 



<210> 12 
<211> 39 
<212> PRT 
<213> chicken 

^n^Il^Thr Gly Thr Val Asn Ser Asn Pro Asn Arg Phe Ser Leu Asp 

- 5 10 " 

Phe Lys Arg Gly Gin Asp He Ala Phe His Phe Asn Pro Arg Phe Lys 

20 25 JU 

Glu Asp His Lys Arg Val He 
35 



<210> 13 
<211> 41 
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<212> PRT 
<213> rat 

Tvr°ile 3 Gln Gly He Ala Lys Asp Asn Met Arg Arg Phe His Val Asn 

1 5 10 15 

Phe Ala Val Gly Gin Asp Glu Gly Ala Asp He Ala Phe His Phe Asn 

20 25 30 

Pro Arg Phe Asp Gly Trp Asp Lys Val 
35 40 



<210> 14 
<211> 41 
<212> PRT 
<213> mouse 

<400> 14 ^ " \. t * ' 

Tyr He Gin Gly Met Ala Lys Glu Asn Met Arg Arg Phe His Val Asn 

1 5 10 15 

Phe Ala Val Gly Gin Asp Asp Gly Ala Asp Val Ala Phe His Phe Asn 

20 25 30 

Pro Arg Phe Asp Gly Trp Asp Lys Val 
35 40 



<210> 15 
<211> 41 
<212> PRT 
<213> human 

Ara°Val 5 Arg Gly Glu Val Ala Pro Asp Ala Lys Ser Phe Val Leu Asn 

1 5 10 IS 

Leu Glv Lys Asp Ser Asn Asn Leu Cys Leu His Phe Asn Pro Arg Phe 

20 25 30 

Asn Ala His Gly Asp Ala Asn Thr lie 
35 40 



<210> 16 
<211> 41 
<212> PRT 
<213> human 

Tvr°ile 6 Gln Gly Val Ala Ser Glu His Met Lys Arg Phe Phe Val Asn 

15 10 15 

Phe Val Val Gly Gin Asp Pro Gly Ser Asp Val Ala Phe His Phe Asn 

20 25 30 

Pro Arg Phe Asp Gly Trp Asp Lys Val 
35 40 



<210> 17 
<211> 44 
<212> PRT 
<213> nematode 

vli°Lei 7 Asn Ser Phe Ser Asn Gly Glu Trp Gly Lys Glu Glu Arg Lys 
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5 10 15 

Ser Asn Pro He Lys Lys Gly Asp Ser Phe Asp He Arg lie Arg Ala 

20 25 . T 

His Asp Asp Arg Phe Gin He He Val Asp His Lys 
35 40 



<210> 18 
<211> 48 
<212> PRT 
<213> eel 



<4O0> 18 

Val Met Asn Ser Thr Leu Lys Gly 

1 5 
Arg Ser Thr Asn Phe Thr Leu Ser 
20 

Leu Ser Tyr Asp He Asn Lys Phe 
35 40 



Asp Asn Gly Trp Glu Thr Glu Gin 

10 15 
Ala Gly Gin Tyr Phe Glu He Thr 
25 30 
Tyr He Asp He Leu Asp Gly Pro 
45 



<210> 19 
<2H> 46 
<212> PRT 
<213> chicken 



5a?°Cyi 9 A sn Ser Met Phe Gin Asn Asn Trp Gly Lys Glu Glu Arg Thr 

i 5 10 

Ala Pro Arg Phe Pro Phe Glu Pro Gly Thr Pro Phe Lys Leu Gin Val 

20 25 JU 

Leu Cys Glu Gly Asp His Phe Lys Val Ala Val Asn Asp Ala 
35 4( ? 



<210> 20 
<211> 45 
<212> PRT 
<213> rat 

Va?°Phe°Asn Thr Met Gin Ser Gly Gin Trp Gly Lys Glu Glu Lys Lys 

i 5 1° 

Lys Ser Met Pro Phe Gin Lys Gly His His Phe Glu Leu Val Phe Met 

20 25 w • 

Val Met Ser Glu His Tyr Lys Val Val Val Asn Gly Thr 

35 - 40 • 45 



<210> 21 
<211> 45 
<212> PRT 
<213> mouse 



Val^Lys Thr Met Gin Ser Gly Gin Trp Gly Lys Glu Glu Lys Lys 

5 1 ^ 

Lys Ser Met Pro Phe Gin Lys Gly Lys His Phe Glu Leu Val Phe Met 

20 25 
Val Met Pro Glu His Tyr Lys Val Val Val Asn Gly Asn 

35 40 45 
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<210> 22 
<211> 46 
<212> PRT 
<213> human 

<400> 22 



52 Cys Asn Ser Lys Asp Gly Gly Ala Trp Gly Thr Glu Gin Arg Glu 

i 5 10 

Ala Val Phe Pro Phe Gin Pro Gly Ser Val Ala Glu Val Cys lie Thr 

20 25 

Phe Asp Gin Ala Asn Leu Thr Val Lys Leu Pro Asp Gly Tyr 



35 



40 



<210> 23 
<211> 45 
<212> PRT 
<213> human 

5 a rPhe 3 Asn Thr Leu Gin Gly Gly Lys Trp Gly Ser Glu Glu Arg Lys 

i 5 10 

Arg Ser Met Pro Phe Lys Lys Gly Ala Ala Phe Glu Leu Val Phe He 

20 25 30 

Val Met Ala Glu His Tyr Lys Val Val Val Asn Gly Asn 
35 40 45 



<210> 24 
<211> 34 
<212> PRT 
<213> nematode 

G^Ph^Lys Asp Tyr Glu His Arg Leu Pro Leu Ser Ser He Ser His 

1 5 10 

Leu Ser He Asp Gly Asp Leu Tyr Leu Asn His Val His Trp Gly Gly 
20 25 30 

Lys. Tyr 



<210> 25 
<211> 29 
<212> PRT 
<213> eel 

isn°Lu 5 Glu Phe Pro Asn Arg Tyr Ser Lys Glu Phe Leu Pro Phe Leu 

1 5 10 

Ser Leu Ala Gly Asp Ala Arg Leu Thr Leu Val Lys Glu 
20 25 



<210> 26 
<211> 34 
<212> PRT 
<213> chicken 
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<400> 26 

His Leu Leu Gin Phe Asn Phe Arg 

1 5 
Lys Leu' Cys He Ala Gly Asp He 
20 

Met He 



Glu Lys Lys Leu Asn Gly He Thr 

10 15 

Thr Leu Thr Ser Val Leu Thr Ser 

25 30 



<210> 27 
<211> 47 
<212> PRT 
<213> rat 



Pro°Phe 7 Tyr Glu Tyr Gly His Arg Leu Pro Leu Gin Met Val Thr His 

1 J 5 10 15 

Leu Gin Val Asp Gly Asp Leu Glu Leu Gin Ser He Asn Phe Leu Gly 

20 25 30 

Glv Gin Pro Ala Ala Ser Gin Tyr Pro Gly Thr Met Thr He Pro 
35 40 . 45 



<210> 28 
<211> 47 
<212> PRT 
<213> mouse 



Ser Phe Tyr Glu Tyr Gly His Arg Leu Pro Val Gin Met Val Thr His 

1 5 10 15 

Leu Gin Val Asp Gly Asp Leu Glu Leu Gin Ser He Asn Phe Leu Gly 

20 25 30 

Gly Gin Pro Ala Ala Ala Pro Tyr Ala Gly Ala Met Thr He Pro 
35 40 45 



<210> 29 
<211> 30 
<212> PRT 
<213> human 

Glu°Phe 9 Lys Phe Pro Asn Arg Leu Asn Leu Glu Ala He Asn Tyr Met 

1 5 10 15 

Ala Ala Asp Gly Asp Phe Lys He Lys Cys Val Ala Phe Asp 
20 25 30 



<210> 30 
<211> 45 
<212> PRT 
<213> human 

Pro°Phe°Tyr Glu Tyr Gly His Arg Leu Pro Leu Gin Met Val Thr His 

i 5 10 - 15 . 

Leu Gin Val Asp Gly Asp Leu Gin Leu Gin Ser He Asn Phe He Gly 

20 25 30 

Gly Gin Pro Leu Arg Pro Gin Gly Pro Pro Met Met Pro 
35 40 45 
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<210> 31 
<211> 23 

<212> PRT 

<213> nematode t 

T^Pr^Val Pro Tyr Glu Ser Gly Leu Ala Asn Gly Leu Pro Val Gly, 
! 5 10 15 



Lys Ser Leu Leu Val Phe Gly 

20 



<210> 32 
<211> 47 
<212> PRT 
<213> rat 



Ala Tvr Pro Ser Ala Gly Tyr Asn Pro Gin Met Asn Ser Leu Pro Val 

1 5 10 15 

Met Ala Gly Pro Pro He Phe Asn Pro Pro Val Pro Tyr Val Gly Thr 

^ 20 25 30 

Leu Gin Gly Gly Leu Thr Ala Arg Arg Thr He He He Lys Gly 
35 " 40 45 



<210> 33 
<211> 50 
<212> PRT 
<213> mouse 

Ala°Tvr 3 pro Ala Gly Ser Pro Gly Tyr Asn Pro Pro Gin Met Asn Thr, 

15 10 15 

Leu Pro Val Met Thr Gly Pro Pro Val Phe Asn Pro Arg Val Pro Tyr 

20 25 30 

Val Gly Ala Leu Gin Gly Gly Leu Thr Leu Pro Arg Thr He He He 
35 40 45 

Lys Gly 
50 



<210> 34 
<211> 47 
<212> PRT 
<213> human 

<400> 34 ore 
Pro Tyr Pro Gly Pro Gly His Cys His Gin Gin Leu Asn Ser Leu Pro 

1 5 10 I 5 

Thr Met Glu - Gly Pro Pro Thr Phe Asn Pro Val Pro Tyr Phe Gly Arg 

20 25 30 

Leu Gin Gly Gly Leu Thr Ala Arg Arg Thr He He He Lys Gly 
35 40 45 



<210> 35 
<211> 49 
<212> PRT 
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<213> nematode 
<400> 35 

Thr Val Glu Lys Lys Ala Lys Arg 

1 5 
Asn Gly Asp He Ser Phe His Phe 
20 

Val He Arg Asn Ser Leu Ala Ala 

35 . 40 

Glu 



Phe His Val Asn Leu Leu Arg Lys 

10 15. 
Asn Pro Arg Phe Asp Glu Lys His 
25 30 
Asn Glu Trp Gly Asn Glu Glu Arg 
45 



<210> 36 
<211> 49 
<212> PRT 
<213> rat 

Jyr^afLeu Pro Thr Ala Lys Asn Leu lie He Asn Phe Lys Val Gly 

Ser Thr Gly Asp lie Ala Phe His Met Asn Pro Arg He Gly Asp Cys 

20 25 JU 

Val Val Arg Asn Ser Tyr Met Asn Gly Ser Trp Gly Ser Glu Glu Arg 
35 40 45 

Lys 



<210> 37 
<211> 49 
<212> PRT 
<213> mouse 

£yr°Jal 7 Leu Pro Thr Ala Arg Asn Phe Val He Asn Phe Lys Val Gly 

1 5 10 I. 5 

Ser Ser Gly Asp He Ala Leu His Leu Asn Pro Arg He Gly Asp Ser 

20 25 
Val Val Arg Asn Ser Phe Met Asn Gly Ser Trp Gly Ala Glu Glu Arg 

35 40 45 

Lys 



<210> 38 

<211> 49 * 
<212> PRT 
<213> human 

?yr°vaf Pro Pro Thr Gly Lys Ser Phe Ala He Asn Phe Lys Val Gly 

\ , 5 10 15 

Ser Ser Gly Asp He Ala Leu His lie Asn Pro His Gly Asn Gly Thr 

Val Val Arg Asn Ser Leu Leu Asn Gly Ser Trp Gly Ser Glu Glu Lys- 
35 40 . 45 

Lys 
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<210> 39 
<211> 48 
<212> PRT 
<213> nematode 

<400> 39 

He Pro Tyr Asn Pro Phe Gly Ala 

1 ~ 5 

Arg Cys Gly Thr Asp Arg Phe Lys 
20 

Phe Asp Phe Ser His Arg Phe Gin 
35 40 



Gly Gin Phe Phe Asp Leu Ser He 

10 15 
Val Phe Ala Asn Gly Gin His Leu 
25 30 
Ala Phe Gin Arg Val Asp Met Leu 
45 



<210> 40 
<211> 48 
<212> PRT 
<213> rat 



<400> 40 

Val Ala Tyr Asn Pro Phe Gly Pro 

1 5 
Arg Cys Gly Met Asp Arg Phe Lys 
20 

Phe Asp Phe Ser His Arg Phe Gin 
35 40 



Gly Gin Phe Phe Asp Leu Ser He 

10 15 

Val Phe Ala Asn Gly Gin His Leu 
25 30 

Ala Phe Gin Met Val Asp Thr Leu 
45 



<210> 41 
<211> 48 
<212> PRT 
<213> mouse 

<400> 41 _ . 

Thr Thr His Asn Pro Phe Gly Pro Gly Gin Phe Phe Asp Leu Ser He 

1 5 10 15 

Arg Cys Gly Leu Asp Arg Phe Lys Val Tyr Ala Asn Gly Gin His Leu 

20 25 30 

Phe Asp Phe Ala His Pro Ser Arg Ala Phe Gin Arg Val Asp Thr Leu 
35 40 45 



<210> 42 
<211> 48 
<212> PRT 
<213> human 



<400> 42 

Thr Thr His Asn Pro Phe Gly Pro Gly Gin Phe Phe Asp Leu Ser lie 

15 10 15 

Arg Cys Gly Leu Asp Arg Phe Lys Val Tyr Ala Asn Gly Gin His Leu 

20 25 30 

Phe Asp Phe Ala His Pro Ser Arg Ala Phe Gin Arg Val Asp Thr Leu 
35 40 45 



<210> 43 
<211> 14 
<212> PRT 
<213> nematode 
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Gln°ile 3 Ser Gly Asp He Glu Leu Ser Gly He Gin He Gin 
1 5 10 



<210> 44 
<211> 13 
<212> PRT 
<213> rat 



^u°ile 4 Lys Gly Asp He Thr ' Leu Ser Tyr Val Gin He 



1 



<210> 45 
<211> 8 
<212> PRT 
<213> mouse 



5 « 



<400> 45 

Glu He Asn Gly Asp He Thr Leu 
1 5 



<210> 4 6 
<211> 13 
<212> PRT 
<213> human 



G^u°ile 6 Gln Gly Asp Val Thr Leu Ser Tyr Val Gin He 
1 5 1° 



<210> 47 
<211> 914 
<212> PRT 
<213> human 

Cy£°Sys 7 A la Gly Cys Cys Ala Ala Cys Gly Ala Gly Cys Gly Gly Ala 

aL Ala Ala Thr Gly Gly Cys Ala Gly Ala Cys Ala Ala Thr Thr Thr 

Thr Thr Cys lly Cys Thr Cys Cys Ala Thr Gly Ala Thr Gly Cys Gly 

40 q 5 



Thr Thr Ala Thr Cys Thr Gly Gly Gly Thr Cys Thr Gly Gly Ala Ala. 

55 oO 
Ala Cys Cys Cys Ala Ala Ala Cys Cys Cys Thr Cys Ala Ala Gly Gly 

£. Thr Gly Gly Cys Cys Thr Gly Gly Cys Gly Cys Ala Thr Gly Gly 

Q5 "v ^Ti • 

Gly Gly Gly Ala Ala Cys Cys Ala Gly Cys Cys Thr Gly Cys Thr Gly 
100 

Gly Gly Gly Cys Ala Gly Gly Gly Gly Gly Cys Thr Ala Cys Cys Cys 
Ala Gly Gly Gly Gly Cys Thr Cys Cys Thr Ala Thr Cys Cys Thr , 

Gly Gly Gly Gly Cys Cys Thr Ala Cys Cys Cys Cys Gly Gly Gly Cys 
145 150 
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Ala Gly Gly Cys Ala Cys Cys Cys Cys Cys Ala Gly Gly Gly Gly Cys 

165 170 1*75 

Thr Thr Ala Thr Cys Cys Thr Gly Gly Ala Cys Ala Gly Gly Cys Ala 

180 185 190 

Cvs Cys Thr Cys Cys Ala Gly Gly Cys Gly Cys Cys Thr Ala Cys Cys 

195 200 205 

Ala Thr Gly Gly Ala Gly Cys Ala Cys Cys Thr Gly Gly Ala Gly Cys 

210 T 215 220 

Thr Thr Ala Thr Cys Cys Cys Gly Gly Ala Gly Cys Ala Cys Cys Thr 
225 230 235 240 

Glv Cys Ala Cys Cys Thr Gly Gly Ala Gly Thr Cys Thr Ala Cys Cys 

245 250 255 

Cvs Ala Gly Gly Gly Cys Cys Ala Cys Cys Cys Ala Gly Cys Gly Gly 

^260 " 265 270 » 

Cvs Cvs Cys , Thr Gly Gly Gly Gly Cys Cys Thr Ala Cys Cys Cys Ala 

275 " 280 285 

Thr Cys Thr Thr Cys Thr Gly Gly Ala Cys Ala Gly Cys Cys Ala Ala 

290 295 300 

Gly Thr Gly Cys Cys Cys Cys Cys Gly Gly Ala Gly Cys Cys Thr Ala 
305 310 315 320 

Cys Cys Cys Thr Gly Cys Cys Ala Cys Thr Gly Gly Cys Cys Cys Cys 

325 330 335 

Thr Ala Thr Gly Gly Cys Gly Cys Cys Cys Cys Thr Gly Cys Thr Gly 

340 345 350 

Gly Gly Cys Cys Ala Cys Thr Gly Ala Thr Thr Gly Thr Gly Cys Cys 

355 360 365 

Thr Thr Ala Thr Ala Ala Cys Cys Thr Gly Cys Cys Thr Thr Thr Gly 

370 375 380 

Cvs Cys Thr Gly Gly Gly Gly Gly Ala Gly Thr Gly Gly Thr Gly Cys 
385 390 395 400 

Cys Thr Cys Gly Cys Ala Thr Gly Cys Thr Gly Ala Thr Ala Ala Cys 

405 410 415 

Ala Ala Thr Thr Cys Thr Gly Gly Gly Cys Ala Cys Gly Gly Thr Gly 

420 425 430 

Ala Ala Gly Cys Cys Cys Ala Ala Thr Gly Cys Ala Ala Ala Cys Ala 

435 440 445 

Gly Ala Ala Thr Thr Gly Cys Thr Thr Thr Ala Gly Ala Thr Thr Thr 

450 455 460 

Cys Cys Ala Ala Ala Gly Ala Gly Gly Gly Ala Ala Thr Gly Ala Thr 
465 470 475 480 

Gly Thr Thr Gly Cys Cys Thr Thr Cys Cys Ala Cys Thr Thr Thr Ala 

485 490 495 

Ala Cys Cys Cys Ala Cys Gly Cys Thr Thr Cys Ala Ala Thr Gly Ala 

500 505 510 

Gly Ala Ala Cys Ala Ala Cys Ala Gly Gly Ala Gly Ala Gly Thr Cys 

515 520 525 

Ala Thr Thr Gly Thr Thr Thr Gly Cys Ala Ala Thr Ala Cys Ala Ala 

530 ~ 535 540 

Ala Gly Cys Thr Gly Gly Ala Thr Ala Ala Thr Ala Ala Cys Thr Gly 
545 550 555 560 

Gly Gly Gly Ala Ala Gly Gly Gly Ala Ala Gly Ala Ala Ala Gly Ala 

565 570 575 

Cys Ala Gly Thr Cys Gly Gly Thr Thr Thr Thr Cys Cys Cys. Ala Thr 

580 585 590 

Thr Thr Gly Ala Ala Ala Gly Thr Gly Gly Gly Ala Ala Ala Cys Cys 

595 600 605 

Ala Thr Thr Cys Ala Ala Ala Ala Thr Ala Cys Ala Ala Gly Thr Ala 

610 " 615 620 

Cys Thr Gly Gly Thr Thr Gly Ala Ala Cys Cys Thr Gly Ala Cys Cys 
625 630 635 640 

Ala Cys Thr Thr Cys Ala Ala Gly Gly Thr Thr Gly Cys Ala Gly Thr 
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645 650 655 

Glv Ala Ala Thr Gly Ala Thr Gly Cys Thr Cys Ala Cys Thr Thr Gly 

660 665 670 

Thr Thr Gly Cys Ala Gly Thr Ala Cys Ala Ala Thr Cys Ala Thr Cys 

675 680 685 

Glv Glv Gly Thr Thr Ala Ala Ala Ala Ala. Ala Cys Thr Cys Ala Ala 

690 695 7 00 
Thr Gly Ala Ala Ala Thr Cys Ala Gly Cys Ala Ala Ala Cys Thr Gly 

705 710 715 720 

Gly Gly Ala Ala Thr Thr Thr Cys Thr Gly Gly Thr Gly Ala Cys Ala 

725 730 735 

Thr Ala Gly Ala Cys Cys Thr Cys Ala Cys Cys Ala Gly Thr Gly Cys 

740 745 750 

Thr Thr Cys Ala Thr Ala Thr Ala Cys Cys Ala Thr Gly Ala Thr Ala 

755 760 765 

Thr Ala Ala Thr Cys Thr Gly Ala Ala Ala Gly Gly Gly Gly Cys Ala 

770 775 780 

Glv Ala Thr Thr Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala 
1B l 790 795 800 

Ala Ala Ala Gly Ala Ala Thr Cys Thr Ala Ala Ala Cys Cys Thr Thr 

805 810 815 

Ala Cys Ala Thr Gly Thr Gly Thr Ala Ala Ala Gly Gly Thr Thr Thr 

820 825 830 

Cvs Ala Thr Gly Thr Thr Cys Ala Cys Thr Gly Thr Gly Ala Gly Thr 

835 840 845 

Gly Ala Ala Ala Ala Thr Thr Thr Thr Thr Ala Cys Ala Thr Thr Cys 

850 855 860 

Ala Thr Cys Ala Ala Thr Ala Thr Cys Cys Cys Thr Cys Thr Thr. Gly 
865 870 875 880 

Thr Ala Ala Gly Thr Cys Ala Thr Cys Thr Ala Cys Thr Thr Ala Ala 

885 890 895 

Thr Ala Ala Ala Thr Ala Thr Thr Ala Cys Ala Gly Thr Gly Ala Ala 
900 905 910 

Ala Gly 
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